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Preface 


This volume provides an introduction to the physics of beams. This field 
touches many other areas of physics, engineering and the sciences, and in 
turn benefits from numerous techniques also used in other disciplines. In 
general terms, beams describe ensembles of particles with initial conditions 
similar enough to be treated together as a group, so that the motion is a 
weakly nonlinear perturbation of that of a chosen reference particle. 

Applications of particle beams are very wide, including electron micro- 
scopes, particle spectrometers, medical irradiation facilities, powerful light 
sources, astrophysics — to name a few — and reach all the way to the largest 
scientific instruments built by man, namely, large colliders like LHC at CERN. 

The text is based on lectures given at Michigan State University's Depart- 
ment of Physics and Astronomy, the online VUBeam program, the US Particle 
Accelerator School, the CERN Academic Training Programme, and various 
other venues. Selected additional material is included to round out the pre- 
sentation and cover other significant topics. 

'The material is at a level to be accessible to students of physics, mathemat- 
ics and engineering at the beginning graduate or upper division undergraduate 
level and can be viewed as an introductory companion to the more advanced 
Modern Map Methods in Particle Beam Physics by M. B., published by Aca- 
demic Press. Emphasis has been placed on showing major concepts in their 
original incarnations and through historic figures. Finally, some of the sec- 
tions and chapters that contain more advanced material are marked by a * 
symbol and can be omitted in a first reading. 

Many organizations and individuals have helped directly and indirectly at 
various stages in the development of this book. MSU's Physics and Astronomy 
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For two decades of continuous financial support that were instrumental 
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research, we are grateful to the US Department of Energy, and in particular to 
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K. M. would like to thank her daughter Kazuko for her own great interest 
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text. 
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Forest for stimulating discussions on various aspects of beam dynamics such 
as normal form theory, and his wife Juxiang Teng for her unwavering support 


xiii 


xiv Preface 


throughout this project. 

All of us want to thank Béla Erdélyi, Gabi Weizman, Pavel Snopok and 
He Zhang for thoughtful comments about the material. We also are thankful 
to many authors, national laboratories and publishers allowing us to repro- 
duce published figures. The details are described in the corresponding figure 
captions. 

Last but not least, we are very grateful to the entire staff of Taylor & 
Francis for their continuous support, in particular to Francesca McGowan for 
her great interest and productive comments. 


Martin Berz 
Kyoko Makino 
Weishi Wan 


Chapter 1 


Beams and Beam Physics 


In this chapter we will lay the foundations of basic concepts about beam 
physics, and discuss various important mechanisms of production and accel- 
eration of beams. Because of the breadth of the material and the multitude 
of existing devices for each of the mechanisms, we will focus only on key con- 
cepts, and introduce them through the eyes of their inventors by using their 
original historical drawings, with only minor adjustments for uniformity of 
style and technical clarity. 


1.1 What Is Beam Physics? 


'The field of beam physics deals with motion of ensembles of particles 
(usually charged) in electromagnetic fields. It is called beam physics due 
to the fact that, in most cases, those particles have similar coordinates, 
which is the rough definition of a beam. In many cases, the positions and 
momenta of the particles are sufficient to describe their motion. In this 
case, the particles are described by a state vector consisting of positions and 
momenta 


Z = (£, Pr, Y, Py, Z, Pz) < 


In other cases, additional coordinates may be needed; typical examples in- 
clude the mass, sometimes the charge, or the spin vector and the related 
magnetic moment and possibly electric moment of the particle. 

An ensemble of particles with such similar coordinates is called a beam 
(see Fig. 1.1), and the sub-fields concerned with the study of such beams is 
called beam physics. There are other fields of physics that can be described 
in very similar terms and language, some of the most notable examples being 
light optics and astrodynamics. There are also other sub-fields of physics 
dealing with the study of the motion of such ensembles of particles; important 
examples are plasma physics and the dynamics of galaxies. These fields 
are different from beam physics in that in their cases, the particles usually do 
not have rather similar coordinates but occupy larger regions. 

The space of state vectors Z is often called phase space, and a coordinate 
system showing Z is often called a phase space diagram. The volume of 
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FIGURE 1.1: A beam — an ensemble of particles in the vicinity of a 
reference particle with phase space coordinate Zo. 


the cloud of particles in phase space has a special name. It is called emit- 
tance. As we shall see later, in many systems the emittance is conserved and 
hence plays a special role. 

Because all particles are close together, it is often useful to pick one of 
these particles, typically one that is somewhere in the middle, and describe 
the motion of the others relative to this reference particle. So if the 
reference particle has coordinates Žo, then the motion of the particles would 
be described in the relative coordinates AZ = Z — Žo. 

In many cases, the density of particles is so low that their interaction can 
be neglected or expressed by simple collective models. In other cases, it is 
necessary to include the study of the self-interaction, i.e., we have to take into 
account the fields due to the space charge. 

If the fields are electromagnetic, then the motion is described by the Lorentz 
force law, which in SI units is 


Z q(E+ox B). (1.1) 


Here É and B are the electric and magnetic fields, respectively. These fields 
are connected to the scalar potential V and the vector potential A via the 
relations 


—-Vx4A, E=-—-VWV. 


Although this may not be directly relevant in this book, we want to note here 
for the sake of completeness that the equations of motion in the form of the 
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Lorentz force law can also be obtained from the Lagrangian 


TR UNE LANE Ü.Á—qV; (1.2) 
= c2 q q , 2 


refer to eqs. (1.85), (1.145) in [5], and, for example, [29]. From this La- 
grangian, one can also obtain a Hamiltonian of the motion in a procedure 
that is standard for all Lagrangian systems. One begins by defining the canon- 
ical momentum as: 


„oL 
Pcan = OB 
which here has the form 
Dean = ym + qÁ = Dayn + qÅ, (1.3) 
where 
1 


1= RES 


and the canonical momentum Pean is different from the relativistic dynamical 
momentum 
Payn = YM. (1.4) 


The Hamiltonian of the motion can then be found as 
H = Pcan :U — L. 


This expression initially contains both Pean and V, and it is necessary to elim- 
inate y and express it in terms of Pean. Because Payn = mv and pays = 


Dean a qA from eq. (1.3), ym = mu/ V F= geer = Pean = gA, leading to 
mo? /(1 = vc?) = (Pean — qgA)?, so we find 


2 g 5 
=e n n l, (1.5) 

E E, 202 
where the expression in terms of pay; is listed as well. Using this, \/1 — v?/c? 


in the Lagrangian L in eq. (1.2) is expressed in terms of Pean, also in terms 
of Dayn, as 


=4/1- — — Eo (1.6) 
(Bean = oA) + m2e2 NI TAE NU EET 


'Thus, we obtain for the Hamiltonian 


2 
H =c. (Bean m A) + m?c? + qV; 
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refer to eq. (1.149) in [5] for details of the derivation. 


When studying the evolution of the beam from the time it is born until it is 
used, there are usually four steps involved. First, there must be a way for the 
production of the beam, and for the sake of efficiency if possible in such a 
way that its emittance is small. Second, in most cases the energy of the beam 
has to be increased; there has to be a mechanism of acceleration. Because 
of the outstanding importance of this process, the whole field is often called 
accelerator physics. Then it is necessary to transport the beam to where 
it is being used. And finally, there is often a need for storage of the beam 
for use at a later time or reuse. Lastly, often there is a need for analysis of 
the beam, in particular after the beam has been used for its purpose, which 
frequently is the facilitation of certain nuclear or high energy reactions. The 
field of beam physics spans all these steps, and each of the steps has it own 
unique problems to be solved. 


1.2 Production of Beams 


'The mechanisms used for the production of the beam depend very much 
on the particular kind of particles and the characteristics of the beam that 
is needed, and they include mechanisms from a variety of different fields in- 
cluding thermal, electrical, atomic, nuclear, and even high energy physics 
processes. Common beams consist of electrons, protons, or H^, and some 
of the beams produced through nuclear and high energy physics processes in- 
clude positrons, antiprotons, pions, kaons and radioactive nuclei. Overall, due 
to the diversity of the species of the particles and the required properties of 
the beam, there are dozens of different ways of producing various beams. We 
here restrict ourselves to some of the source types that are most commonly 
used in particle accelerators and electron microscopes. 


1.2.4 Electron Sources 


Electrons exist in abundance in metals, and forming them into beams re- 
quires their extraction from the metal, called the cathode. For this the 
electrons need to overcome the potential barrier, i.e., the work function, 
at the boundary between the metal and the environment. The work func- 
tion usually ranges from a fraction of an electron Volt (eV) to a few electron 
Volts; for comparison, the average kinetic energy of gas molecules at room 
temperature amounts to approximately 1/40 eV. This can be achieved by ei- 
ther supplying additional energy to the electrons so that they can leave the 
material, or by lowering the work function. In the following we discuss some 
common approaches based on these methods. 
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FIGURE 1.2: Sketch of an early thermionic emission electron source. 
(Reprinted with permission from J. R. Pierce, J. of Appl. Phys., 11:548, 
1940 [57]. Copyright 1940, AIP Publishing LLC.) 


The first of these processes is thermionic emission. By heating a piece 
of metal to temperatures exceeding around 1000°C, a small fraction of the 
electrons will achieve energies exceeding the work function and can thus leave 
the metal. This type of source is usually called the thermionic gun. Once 
outside the metal, the electrons can be pulled away further by the application 
of strong electric fields, the distribution of which is adjusted to achieve high 
gradient and optimal focusing. An example of such a device is shown Fig. 1.2. 
Here the number of electrons available is determined by the temperature of 
the donor metal or cathode, and only those electrons in the tail of the Fermi- 
Dirac distribution above the work function can be extracted. This process is 
quantitatively described by the Richardson-Dushman equation 


4 k? 
J= ( rem 2) T2e-W/kaT. (1.7) 


where J is the current density, e is the charge, m is the mass, kg is the 
Boltzmann constant, h is the Planck constant and T'is the temperature of 
the cathode. It is obtained from the third law of thermodynamics and char- 
acterizes an idealized situation of a sufficiently large piece of cathode material 
to avoid quantum mechanical influences, and the absence of electric fields 
influencing extraction. 

In practice the extracted current is also greatly affected by any electric field 
applied to the cathode. This is the result of the Coulomb repulsion among the 
extracted electrons, where electrons extracted earlier can push those extracted 
later back into the cathode. When the electric field at the surface vanishes, no 
more electrons will be extracted. The relation between the maximum current 
density and the applied electric field, for a parallel flat cathode and a matching 
anode, is the Child-Langmuir Law 


4. Jos ye y 
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FIGURE 1.3: Left: Sketch of one of the earliest electron sources using 
point cathodes. (From Y. Sasaki and S. Maruse, Uber die Arbeitsweise und 
die elektronenoptischen Eigenschaften der Spitzenkathode, in G. Móllenstedt, 
H. Niehrs, and E. Ruska, eds., Physikalisch- Technischer Teil, 1:9, Springer- 
Verlag, 1960, © Springer-Verlag Berlin Heidelberg 1960 [61]. Abb. 3, “Zwei 
Anordnungen der Elektrodensysteme für die Spitzenkathode." With kind per- 
mission from Springer Science and Business Media.) Right: The potential 
(dashed) and the field (solid) distribution near the cathode tip. 


where J is the current density, eo is the dielectric constant in vacuum, e is the 
charge, m is the mass, Vo is the applied voltage between the cathode and the 
anode, and d is the distance between the cathode and the anode. In practice 
the situation is more involved, and the maximum current density is usually 
the smaller of the two quantities. For flat thermionic cathodes, usually eq. 
(1.7) sets the limit for the extracted current. 

Due to the high operating temperature, the energy spread of the extracted 
electron beam is relatively large. Nonetheless, the thermionic gun is simple 
and reliable, and hence is still widely used as the source for many devices 
where the large energy spread of the electrons at the cathode is not limiting 
the performance of the machine. One significant example are circular electron 
accelerators, where the ultimate energy spread in the beam is dominated by 
other processes including synchrotron radiation discussed later. 

'The second process to produce electrons is field emission. In this mech- 
anism, a sharp needle is brought into strong external electric fields. This 
type of source is usually called the field emission gun. Because the needle 
is a conductor, it acts as an equipotential surface, and thus produces very 
strong electric fields near its tip. In practice the radius of curvature at the tip 
often ranges from below 1 nm to nearly the range of single atoms to 1 um, 
and one locally obtains a very strong field ranging from 1 to 3 GV/m. These 
strong electric fields acting near the surface lower the work function and si- 
multaneously reduce the width of the potential barrier, which allows electrons 
to escape through the well through quantum tunneling. All these electrons 
emerge from a small area. Furthermore, due to the fact that the electrons 
are diverging near the surface of the needle, the actual source is quite a bit 
smaller than the emitting area. Meanwhile, for low current, their spread in 
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momenta is rather small as well. Fig. 1.3 shows the basic principle. 

The small emitting area imposes a severe limit on the total current ex- 
tracted, but it is the standard electron source for transmission electron mi- 
croscopes discussed in more detail below. Here the current need is low, but 
the origination from a small area and with small energy spread amounting to 
what is called high brightness is very useful. The use of sharp needles (point 
filaments as they were historically called) was pioneered in the 1950s in order 
to increase brightness of the electron beam in microscopes. 

Particularly fruitful is combining needle geometries, resulting in an effective 
lowering of the work function due to stronger electric field at surface, with 
heating, which results in an increase of electrons of higher energy than the 
work function that can thus traverse it. This kind of thermionic emission with 
significant external field is called Schottky emission. In the 1960s, a new 
generation of cathodes (mainly ZrO/W, which is a tungsten tip covered with a 
thin layer of ZrO) were developed with stronger field at the surface, where field 
emission plays a significant role and complements Schottky emission. This 
regime is called the extended Schottky emission. This kind of emitters 
are the main sources of electrons for electron microscopes. The emitters that 
produce electrons through only the field emission process, called cold field 
emission gun (CFEG), have been studied since the 1970s, and recently 
have been able to produce high brightness beams. 

'The third process to produce electrons is photoemission where electrons 
are produced via the photo effect. Exposing a surface to a large flux of 
photons leads to some of the photons being absorbed by electrons within the 
material, which consequently increase their kinetic energy. This additional 
kinetic energy acts very similar to intense local heating in the thermionic gun 
and leads to electrons with energies exceeding the work function and which 
can consequentially escape. In certain cases photo-energized electrons can also 
leave the material directly without colliding with other electrons, in a ballistic 
process. In practice, photons are supplied through laser pulses, of which the 
intensity, spot size and duration are relatively easy to adjust. This leads to 
the photocathode gun. This approach has greatly facilitated the advance 
of free electron lasers (FELs) in the last few decades and is an important 
component in efforts of time-resolved spectroscopy and microscopy. 

Again the extracted current is limited by the Child-Langmuir law (1.8), 
despite the fact that an intense laser pulse can often produce large numbers of 
electrons. Different classes of materials have been developed for the cathode, 
including GaAs that has been cesiated, i.e., covered by less than a mono-layer 
of cesium. This has allowed the production of electrons with energy spreads 
down to fractions of 107! eV, which is the range of thermodynamic energies 
encountered at room temperature. 

Other materials such as copper are able to withstanding the harsh environ- 
ment of a radio frequency (RF) gun, where very high extraction fields can 
be produced that significantly exceed those of the electrostatic case. Fig. 1.4 
shows the layout of an RF gun and the field distribution. It consists of roughly 
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FIGURE 1.4: Layout (left) and field distribution (right) of the RF gun at 
the Linac Coherent Light Source (LCLS). (From J. Arthur, et al., SLAC-R- 
593, 2002 [2]. Courtesy SLAC National Accelerator Laboratory.) 


1.5 cells (1.6 for this example) with the field in the two cells in opposite phase 
to ensure that electrons emitted into high extracting field end up with high 
energy and low emittance at the exit. In addition, materials such as cesiated 
GaAs with strained lattice can also produce polarized electrons, which has 
been used at SLAC National Accelerator Laboratory, California, USA, for 
high energy physics experiments on the one hand and in spin polarized low 
energy electron microscope (LEEM) on the other. 


1.2.2 Proton Sources 


In most proton accelerators, the protons are produced through stripping 
the two electrons in a negative hydrogen ion H^ at the entrance, mostly by 
passing the ion through a thin carbon foil, although lasers have also been ex- 
perimented with recently. In some applications, such as medical accelerators, 
Hj ions instead are produced and accelerated. 

Although the stripping process makes the technology a little more com- 
plicated, it has one distinct advantage over injecting protons directly. For 
injection into circular accelerators over many turns, the protons generated 
through stripping can easily be aligned with earlier produced protons already 
in the accelerator, because before stripping, both types of particles follow dif- 
ferent paths due to the differing charge. As a result, the density of protons in 
the beam can increase more and more over an extended injection period. On 
the other hand, protons injected directly require much more care and can only 
be injected with positions and directions that are different from those of other 
protons already in the ring, since because of time reversal, all protons with the 
same position and direction must have followed the same earlier trajectory. 

The H7 ions can be produced by a variety of different methods. Here, only 
the surface plasma source of the magnetron type is discussed. It obtained 
its name due to the fact that it is similar in configuration to the ubiquitous 


Beams and Beam Physics 9 


Q) MAGNETIC FIELD MAGNETIC POLE 


ANODE 


ANODE 
CATHODE 

ION EXTRACTOR 
GAS INLET CATHODE 


ELECTRON COLL 
ION EXTRACTOR FROR 


ELECTRON COLLECTOR MAGNETIC FIELD 
| (1-2kGauss) 


FIGURE 1.5: Drawing of a surface plasma source of the magnetron geome- 
try. (From J. Ishikawa, Negative ion sources, in I. G. Brown, ed., The Physics 
and Technology of Ion Sources, 2nd ed. [31]. Copyright (c) 2005 Wiley-VCH. 
With permission.) 


microwave source called the magnetron, which is used in common appliances 
such as the microwave oven. The main process of producing the negative 
hydrogen ions is through electron capture of neutral hydrogen atoms on the 
cathode surface, where electrons penetrate the potential barrier through quan- 
tum tunneling. The presence of the electric and magnetic field creates a dense 
plasma near the surface of the cathode where ions, most of which are positive, 
are produced. Those positive ions and neutral particles bombard the cathode 
partially covered with cesium and H7 ions are produced. The presence of ce- 
sium lowers the work function and greatly increases the probability of barrier 
penetration and hence H^ production. 

Some of the produced H^ ions are neutralized shortly after production. 
An electric field between the cathode and the anode is used to accelerate 
the remaining H^ ions towards the exit. However, electrons are accelerated 
towards the anode as well. But since electrons are much lighter than the H^ 
ions, they are bent much more easily than the H^ ions, and are absorbed by 
the electron collector (Fig. 1.5). 

The magnetron type of negative hydrogen ion source was first developed 
in the former Soviet Union in the early 1970s and quickly spread to Europe 
and the United States. It has become the main choice for high energy proton 
accelerators. 


1.2.3 Ion Sources 


There are a large variety of different sources for ions which found applica- 
tions in many fields. Presenting even a brief overview is already beyond the 
scope of this book; for more comprehensive reviews, see [77, 8]. As in the 
previous subsection, we limit our discussion to one important kind, which is 
the electron cyclotron resonance (ECR) ion source. Fig. 1.6 shows the 
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mechanism schematically. The chamber on the left holds the plasma where 
ions are produced. First, the gas of the element of interest is injected to 
the chamber. Collisions among the atom generates electrons and ions. The 
magnetic field (see top part of Fig. 1.6) forms a magnetic mirror that con- 
fines the ions and the electrons. For the example shown, this time is around 
100 us, and for more modern and advance versions for around 10 ms. High 
frequency (2.45 to 28 GHz) microwaves are injected into the chamber and 
electrons with rotation frequency matching that of the microwaves are accel- 
erated to between 1 and 20 keV. This process of heating up the electron gas is 
called electron cyclotron resonance heating. The resulting hot electrons 
collide with ions and neutral atoms and generate more ions. Furthermore, 
through step-by-step ionization, even multiply charged ions (e.g., Xe38+) can 
be produced. 

In order to produce sufficient quantities of multiply charged ions, the ion 
confinement has to relatively long (~ 10 ms). The major improvement in this 
aspect is the addition of a sextupole magnet, which ensures that the magnetic 
field at the center of the chamber is at the minimum, which prevents the ions 
from drifting to the side wall. Another consequence of this configuration of 
the magnetic field is that the surface on which electron cyclotron resonance 
heating takes place is now closed, which significantly reduces hot electron 
loss. This configuration of the magnetic field also makes ion production more 
efficient, since electrons are confined longer and can collide with ions and 
be reheated many times. The fact that the ECR ion source does not use à 
cathode to generate electrons makes it a much more reliable source compared 
to other varieties. 

'The ECR ion source was first developed in the mid-1960s and, by the mid- 
1970s, many had been built around the world. It is probably the best source to 
produce multiply charged ions and has become the main choice of ion sources 
for nuclear physics facilities. 


1.3 Acceleration of Beams 


We now assume that an ensemble of particles occupying a small volume of 
phase space has been created, and we thus have what is called a beam. In 
many if not most of the practical cases, the energy that the beam has after 
being produced by the source is not sufficient for the purpose it is to be used 
for, which frequently amounts to furnishing the energy necessary for atomic, 
nuclear, or particle processes of interest. 

In most cases, the motion is best studied by first considering the motion 
of the reference particle, and once this motion is understood satisfactorily, 
to study the relative motion of the other particles. For a simple analysis 
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FIGURE 1.6: Layout of the first electron cyclotron resonance (ECR) ion 
source that produced multiple charged ions. (Reprinted with permission from 
R. Geller, Appl. Phys. Lett., 16:401, 1970 [28]. Copyright 1970, AIP Publish- 
ing LLC.) 


of the relative motion, often a linear approximation with all the resulting 
simplifications is possible, but frequently a full understanding of the motion 
can only be achieved by considering the nonlinear effects. 

Considering the special shape of the Lorentz force law (see eq. (1.1)), 
since 7 x B is perpendicular to the velocity v, it is apparent that magnetic 
fields cannot be used for purposes of acceleration, which requires forces in 
the direction of the particle. Thus any acceleration has to be provided by 
electric fields. However, as we shall see, also magnetic fields have very good 
use in particle accelerators, as they can be employed to guide the beam to 
where it is needed. In particular, in the process of acceleration they are often 
used to guide the beam through the same region of electric field repeatedly 
and thus allow the device to maximize the use of the electric fields. Indeed, 
for this purpose of guiding the beam, magnetic fields are usually even better 
suited than electric fields. This is because for the high velocities that beams 
usually have after even modest acceleration, the forces that can be attained 
with technologically available magnetic fields far exceed those that can be 
achieved with the respective electric fields. 

Very generally, the amount of energy K a particle gains while traveling 
from time tı to time tz in an electric field E (r,t) that depends on position 
and time is given by the path integral 


3 esp. | ° (F(A), t) TE) dt, 
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FIGURE 1.7: The general principle of the Cockcroft- Walton generator. 
(From J. D. Cockcroft and E. T. S. Walton, Proc. Royal Soc. London, A, 
136:619, 1932 [14]. With permission.) 


where F(t) is the particle's position as a function of time and U(t) its velocity. 
In the special case that E is time independent and hence can be written 
in terms of a potential via E- —ŪV, this path integral reduces in a natural 
way to the difference in potential as 


K =q (VF) - V(%)). 


This simple fact implies a very important consequence for the design of electric 
accelerating fields: if there is to be any chance to utilize the same electric field 
repeatedly for the purpose of acceleration, then the electric field has to 
be time dependent, because otherwise repeated passing just results in a 
periodic increase and decrease of energy. In fact, the attempt to build an 
accelerator trying to increase energy repeatedly by flying through the same 
time independent field is tantamount to the attempt to build a perpetual 
motion machine. 


1.3.1 Electrostatic Accelerators 


The first class of important accelerators are those based on static electric 
fields. The kind that grew directly out of the area of electric circuit is the 
voltage multiplier that is now known under the name Cockcroft-Walton 
generator. It consists of a simple but clever circuit made of diodes and ca- 
pacitors, forming the voltage multiplier ladder. Each time a base voltage is 
applied, the first capacitor is filled with charge. Each time the voltage is re- 
moved, the first capacitor passes on some of its charge to the second capacitor, 
which in turn feeds the third capacitor, and so on. Depending on the number 
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FIGURE 1.8: Design sketch of the Van de Graaff high voltage generator. 
(From R. J. Van de Graaff, US Patent 1,991,236, 1931 [18].) 


of capacitors and the number of cycles applied, quite high voltages can be 
obtained very easily. 

For small applications it is possible to simply apply alternating current 
(AC) at the feeding end. For larger applications it may require a longer 
time to transfer the charges from the lower to the higher capacitors, and 
the change in input voltage is achieved through mechanical switches. Fig. 1.7 
illustrates *the general principle underlying the method adopted," and is from 
the original paper on the matter [14]. Many high energy proton accelerators 
use Cockcroft- Walton generators as the first stage of acceleration. 

Another method to obtain high voltages for electrostatic accelerators is the 
Van de Graaff accelerator [20] and several similar devices derived from it are 
the main representatives of the class of accelerators utilizing time independent 
fields. The voltage difference that the particles travel through is obtained with 
a Van de Graaff generator, which consists of an endless non conducting 
belt onto which charge is sprayed from a tip via field emission, and which is 
then transported to the inside of a hollow metal sphere where it is deposited. 

Since any charge on a conducting object accumulate on the outside and 
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FIGURE 1.9: Design sketch of the use of the Van de Graaff generator as 
a particle accelerator. (From R. J. Van de Graaff, US Patent 1,991,236, 1931 


[18].) 


create a field-free interior, a new charge can be brought in from the belt on 
the inside of the sphere without experiencing any opposing fields, and thus 
large amount of charges can be accumulated on the sphere, resulting in very 
high potentials. The mechanism of the Van de Graaff generator is shown in 
the left part of Fig. 1.8 and the complete machine is shown in Fig. 1.9. The 
sketches are from the original patent on the device [18]; see also [20]. 

In passing it is worthwhile to remark that while the newly added charge 
does not experience a field when moving from the belt to the inside of the 
sphere, it certainly experiences a field while approaching the sphere and being 
attached to the belt. Thus the potential energy contained on the charged 
sphere does not come for free. It is generated through the mechanical work 
that is necessary to move the belt and the attached charges toward the sphere. 

The charged sphere is connected to a metal enclosure containing the ion 
source, thus elevating the source to the potential of the charged sphere, which 
can then be utilized for the acceleration of the particles. 

The main practical limitation of the Van de Graaff accelerator is the ne- 
cessity to prevent sparks. This is achieved on the one hand by sheer size, 
because at the same potential difference, larger size means less electric field 
strength. On the other hand, it is important to inhibit the spark forma- 
tion process. Microscopically, sparks form in a gas when small numbers of 
charged particles have a mean free path length that is long enough so they can 
attain energies sufficient to ionize other particles upon collision, resulting in 
an avalanche. This can be avoided by choosing inert gases like He (helium) 
or SF¢ (sulfur hexafluoride), and on the other hand applying high pressure to 
reduce the mean free path length. 
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FIGURE 1.10: The principle of the tandem Van de Graaff accelerator. 
(Reprinted from Nucl. Instrum. Methods, v. 8, R. J. Van de Graaff, Tandem 
electrostatic accelerators, p. 195-202, Copyright (1960), with permission from 
Elsevier [19].) 


'The Van de Graaff accelerator has several desirable features; for example, it 
can produce a fully continuous beam (often denoted by the term “cw” for con- 
tinuous wave) and at high beam current. Its main limitation is the relatively 
low energies that it can produce, which seldom exceed about 20 MeV. 

The tandem Van de Graaff is an efficient modification of the Van de Graaff 
concept, in which both the source and the target are kept at ground potential 
and which can efficiently increase the energy that can be obtained. For 
this purpose, a source is chosen that produces negatively charged ions, which 
are then sent through a regular Van de Graaff. At the end of the accelerat- 
ing section, the ions are sent through a thin foil, in which many of them are 
stripped of some of their electrons, resulting in positive ions. Because the 
particles already have substantial energy when hitting the foil, often much 
higher charge states can be produced than in the ion source itself. These 
positive ions are then sent through a second stage Van de Graaff, which is 
essentially a reversion of the first stage. At the location of the target, depend- 
ing on their charge state after stripping, their energy is increased by a factor 
of two or more. The mechanism of the tandem Van de Graaff accelerator is 
shown in Fig. 1.10. Having very similar characteristics to the original Van de 
Graaff, the energies that can be achieved in this way are in the range of up 
to 60 MeV. 


1.3.2 Linear Accelerators 


It is an important observation that the field strength that can be obtained 
in quickly oscillating (radio frequency, RF) electric fields can be sub- 
stantially higher than what can be obtained statically in devices of similar 
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FIGURE 1.11: Sketch of the principle of the linear accelerator of the 
Wideroe type. (From R. Wideróe, Über ein neues Prinzip zur Herstellung ho- 
her Spannungen, Archiv für Elektrotechnik, 21:387, 1928, © Springer-Verlag 
Berlin Heidelberg 1928 [73]. Bild 1, “Prinzip der Spannungstransformation 
mit Potentialfeldern." With kind permission from Springer Science and Busi- 
ness Media.) 


size. This is mostly due to reduced presence of spark formation, because the 
formation of an avalanche of charged particles requires time scales that are 
usually larger than the time the field is in one phase. 

'The use of an oscillating field, however, immediately entails that only half 
of the cycle can be used for acceleration, and thus is different from static ac- 
celerators, as the resulting beams always have a temporal micro-structure, 
also called bunched. In practical use, usually several RF resonators are used 
sequentially, each one of which accelerating the particles, and it is very impor- 
tant that the phase relationship between the individual accelerating sections 
is correct. This is usually achieved by applying the fields between the edges 
of adjacent conducting tubes. This kind of device is called the linear accel- 
erator or linac. 

'The concept of the earliest linacs is schematically shown in Figs. 1.11 and 
1.12. From the wiring scheme, it is clear that the electric field in adjacent 
gaps points to opposite directions. In order to ensure that charged particles 
are accelerated in every gap, the lengths of the tubes are chosen in such a 
way that the time the particles require to fly through them equals one half of 
the RF period. So the length L; of the ith tube has to be chosen so that it 
satisfies 


1 
Li = 3"ihit 
where T,¢ is the period of the RF frequency. Apparently this leads to a system 
of tubes of increasing length, i.e., Lj < Lə < L3 < .... The exact lengths 


Li, of course, depend on the relationship between the kinds of particles and 
the values of the accelerating voltages, and so often these designs are rather 
customized geometries. Since metal wires are used to connect the tubes, the 
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FIGURE 1.12: Illustration of a linear accelerator, designed by E. O. 
Lawrence and H. D. Sloan. (Reprinted the middle picture of Fig. 1 with 
permission from [64] as follows: D. H. Sloan and E. O. Lawrence, Phys. Rev., 
38, 2021, 1931. Copyright (1931) by the American Physical Society.) 


frequency of the oscillating field is limited to below 100 MHz due to increased 
radiation at high frequency. This in turn imposes an upper limit on the 
velocity of the particles (8 = v/c « 0.03) due to practical limit of the length 
of the tubes. Meanwhile, use of wires at low frequency can significantly reduce 
the size of the accelerating structure compared to a closed structure, called 
RF cavity. 

Another type of linac, called the Alvarez linac, developed in the late 1940s 
is shown in Fig. 1.13. It uses closed structures, the RF cavities, to increase the 
oscillating frequency and reduce the length of the tubes. Another difference 
is that the field in adjacent gaps points to the same direction. As a result, 
the length L; of the ith tube has to be chosen so that it satisfies 


Li = Vy. 


Apparently, the frequency has to be at least twice of the Wideróe type to 
reduce the tube length. In practice, the frequency of the Alvarez linac is 
roughly an order of magnitude higher than that of the Wideróe type. 

Fig. 1.13 is remarkably informative of the physics of the accelerator. The 
following sentences are excerpts from the original US patent [1]. The left top 
picture is “a diagram showing a normal cylindrical wave guide and the axial 
electric field distribution therein." The left bottom picture is “a diagram 
showing a wave guide and the electric fields when a series of graded drift 
tubes are placed therein." The right top picture is “a diagram representing 
the voltage existing across the gaps of the drift tubes." The left picture of 
the right bottom corner is “a diagrammatic longitudinal sectional view of 
drift tube ends with the electric field distribution existing across the gap." 
The right picture of the right bottom corner is “a diagrammatic longitudinal 
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FIGURE 1.13: Sketches illustrating the basic principles of the linear ac- 
celerator of the Alvarez type. (From L. W. Alvarez, US Patent 2,545,595, 
1947 [1].) 


FIGURE 1.14: The structure of the RFQ, the radio-frequency quadrupole 
linear accelerator. The picture shows an early example at Los Alamos Na- 
tional Laboratory, New Mexico, USA. (From K. R. Crandall, et. al., in R. L. 
Witkover, ed., Proc. 1979 Linac Conf., BNL-51134, 1979 [17]. Courtesy 
Brookhaven National Laboratory.) 


section view of drift tube ends with a focusing foil attached to one tube, and 
the resulting electric field distribution.” For details regarding phase stability 
(the right top picture), see Section 10.2. For details regarding transverse 
focusing and defocusing, see Section 10.4. 

An interesting combination of the need for bunching, accelerating and 
focusing (which is discussed later in detail) is the radio frequency quadrupole 
(RFQ) accelerator. Developed in the late 1960s in the former Soviet Union, 
RFQs have been widely adopted as injectors for proton and ion accelerators. 
Fig. 1.14 shows the structure of a RFQ accelerator. The four vanes break 
the rotational symmetry and produce an electrostatic quadrupole field that 
oscillates with time. The traveling particles feel the quadrupole field that 
changes polarity with time and are focused in both transverse planes. The 
longitudinal electric field that accelerates the charged particles is produced 
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through modulation of the vanes. Similar to the drift tubes, the particles are 
accelerated throughout the structure when the distance between the peak and 
the neighboring valley satisfies 


Li = sue 

In general, linacs can provide beams of high current, and of higher en- 
ergies than static accelerators, yet because of the single use of each electric 
field, they are still rather expensive per MeV. Linacs are frequently used as 
pre-accelerators for accelerators of higher energies. They also have the dis- 
tinctive advantage that they avoid synchrotron radiation, which is often 
a limiting factor in circular accelerators for light particles. This aspect is 
very important for electron and positron high energy accelerators such as the 
Stanford Linear Collider (SLC) at SLAC National Accelerator Laboratory, 
California, USA. It is the main reason for the interest in next generation 
Linear Colliders, such as plans being considered for an International Linear 
Collider (ILC), where a pair of two linacs shoot electrons and positrons at 
each other at very high energy. 

Recently, linear accelerators have been widely used in producing a free 
electron laser (FEL), whose high peak brightness and short pulse duration 
has opened up unprecedented opportunities for scientific investigations. Fig. 
1.15 shows the setup of the first FEL experiment. Electrons go through a 
magnetic device called the undulator, which consists of alternating magnetic 
poles. As a result, the trajectory of such an electron is very similar to a sine 
function, causing the emitted photon field to add coherently. Together with 
the large number of periods, the peak intensity of the X-ray can be orders 
of magnitude higher than that from a circular accelerator (see the following 
subsection). The advantage of a linac is that it can produce an electron beam 
with smaller emittance and shorter pulse duration. 


1.3.3 Circular Accelerators 


Arguably the simplest circular accelerator is the betatron, which, besides 
its practical use as a compact accelerator for lower energies, also represents 
an excellent textbook style application of principles of electrodynamics. In 
the case of the betatron, the orbit follows a circular shape, which is achieved 
by a magnetic field. If the motion is perpendicular to the magnetic field, then 
we have in SI units 

mv? B , mu p 
—— — quB, andso p= — = —, 
p qB qB 
and so the radius of motion depends only on the momentum and charge of 
the particle as well as the magnetic field. Note that the equation is correct 
even in the relativistic case, if m is understood to mean the relativistic mass 


20 An Introduction to Beam Physics 


electron interaction region maintained 
gun at ground potential collector 


low voltage 
supply 


high voltage 
electrostatic 
generator 


FIGURE 1.15: Sketch of the first Free Electron Laser (FEL). (Reprinted 
with permission from J. M. J. Madey, J. Appl. Phys., 42:1906, 1971 [47]. 
Copyright 1971, AIP Publishing LLC.) 


m = y: mo. Commonly the ratio of momentum and charge p/q is denoted by 
Xm and called magnetic rigidity; we apparently have 
Xm — P Bp. 
q 
Because x4, = Bp, the magnetic rigidity has the unit Tesla meter (Tm), and 
is frequently simply referred to as B rho. 

In the case of the betatron, both bending and acceleration come from the 
same source, namely a magnetic field the strength of which increases with 
time in such a way that its magnitude matches the increasing energy of the 
particles to keep them at nearly constant radius, and the circular induced 
electric field provides the acceleration for the particles. Fig. 1.16 shows a 
sketch of the betatron [37, 38]. 

It is worthwhile to note that the basic idea of utilizing an electric field 
produced by a changing magnetic field also occurs in an application from 
daily life: certain modern cooking surfaces. In this case, the electrons that 
are accelerated are not within the vacuum of a beam pipe, but merely in the 
metal that constitutes the bottom of the pot used for cooking; and of course 
since their mean free path is short, they do not attain high energies before 
colliding with either other electrons or the lattice atoms, thus transferring 
their whole kinetic energy to heat. 

A quantitative understanding begins with Faraday’s law of induction, now 
one of Maxwell's equations: 


OB 


Vx E=-— 
3 Ot’ 
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FIGURE 1.16: Illustration of the magnet of a betatron, from the original 
first paper on the subject. (Reprinted Fig. 2 with permission from [37] as 
follows: D. W. Kerst, Phys. Rev., 60, 47, 1941. Copyright (1941) by the 
American Physical Society.) 


and its integral form over a surface A with the bounding C is 


f E=- [ Saas 


POR EE S 
Here we restrict our interest to circular orbits with a radius r, and the sur- 


face A is the inside of the circle. Building the magnet rotationally symmetric 
entails a rotational symmetry of the fields, which simplifies the situation to 


1 d$ — 1 „dB  rdB 


E, = —— — = —— —— — — — 
A 2mr dt Inr dt 2 dt’ 


where B is the average magnetic field enclosed by the orbit. Thus, by denoting 
the strength of E; simply by E, 


and below we denote |B| by B for simplicity. Thus we obtain for the momen- 
tum p — mv > " 
rdB B 
= =qE=q-— > m = qr. 
ap te p TURIS 
On the other hand, it is necessary that the centrifugal force on the orbit with 
radius r is compensated by the Lorentz force at that radius, which requires 


mv? 


— = quB (r) > mv = qrB (r). 
* 
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Thus, altogether we obtain the following relationship between the field B(r) 
at the orbit r and the average field: 


'This equation of central importance is often called the betatron condition. 
It requires a magnetic field that is stronger in the center than where the 
particles move, which can be achieved by suitably shaping the poles of the 
magnet. 

In principle the temporal behavior of B is irrelevant, and in practice one 
usually tries to ramp it quickly, because the pulsed beam is only available 
at the end of ramping. This is usually achieved by making the magnet part of 
an LC circuit (a resonant circuit, consisting of an inductor L and a capacitor 
C), which also conveniently allows the device to recover the energy stored in 
the magnetic field for the next ramping. For the practical use, it is important 
to try to limit Eddy currents in the iron of the magnets, and in order to 
maintain the condition B (r) = B/2, it is important to control saturation 
effects that may occur at any edges of the magnet. 

'The transverse confinement of the beam in the betatron is achieved through 
the inhomogeneity of the outer field, through effects that will be studied in 
subsequent chapters. The practical use of betatrons is nowadays mostly for 
electrons, where energies of about 300 MeV have been achieved; for protons, 
the values are about 50 MeV. 

Also in the microtron, which was invented by V. Veksler [69], a magnet 
is used to bend the particles to let them pass through the same source of 
electric field repeatedly. Different from the betatron, the emphasis here lies 
on the production of a continuous beam. Since this requires that the whole 
acceleration process must be independent of the specific time of injection, 
this entails that the magnetic field is constant in time. Thus an external 
voltage source is needed; as discussed above, if it is to be used repeatedly, 
it has to be a time dependent source, and in practice it is chosen to be an 
RF (radio frequency) cavity. Altogether, the motion follows a sequence of 
tangential circles of increasing radius that touch at the location of the RF 
cavity, as shown in Fig. 1.17. 

In order to synchronize the particle's motion and the momentary direction 
of the magnetic field, the revolution frequency of the RF cavity wo has to be 
a multiple of the particle's revolution frequency w, which can be obtained 
simply from 


2 
IUS nese es Ct (1.9) 
T q ymo 


This means it has to be either the motion is such that y = 1, which corresponds 
to non-relativistic motion and hence severely limits the energy, or just enough 
acceleration is provided in each turn that the revolution frequency decreases 
to the next multiple of the RF frequency. So the revolution frequencies would 
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FIGURE 1.17: Illustration of the first microtron. (From S. P. Kapitza, 
The Microtron, Harwood Academic, London, 1978 [35]. Fig. 1.9, p. 14. With 
permission: © Taylor & Francis.) 


follow the pattern 
Wo Wo Wo wo 


— fs 1.1 
25 92-7 95 aD 


W = WO, 
This entails that the factor y follows the sequence y = ^o, 2y0, 30, 40; . . ., 
which requires Ay = 1 per turn. Since E = mc? = ymoc?, this means 
AE = moc?, and thus the necessary energy gain per turn must equal the rest 
mass energy of the particle under consideration. For electrons, this means 
AE = 511 keV and is thus possible; for protons, AE = 938 MeV and this is 
not easily possible within the confines of a conventional magnet. 

A very important further development of the concept of a microtron is 
based on the fact that if the orbits of the particles are far enough separated 
so that one can apply different magnetic fields for each orbit and can even 
change the shape of the orbit away from circular, then by careful choice of the 
orbit lengths, it is possible to maintain the synchronicity condition (1.10) 
while maintaining the freedom to have any amount of acceleration that is 
convenient. This is the basic idea behind CEBAF, the Continuous Electron 
Beam Accelerating Facility, at Thomas Jefferson National Accelerator Facility 
(TJNAF, Jefferson Lab, JLab), Newport News, Virginia, USA. 
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FIGURE 1.18: The principle of the cyclotron, with top view on the left 
and side view on the right. (From E. O. Lawrence, US Patent 1,948,384, 1932 
[40].) 


'The basic idea of the cyclotron is similar to that of the microtron, except 
that the RF cavity is used more efficiently by providing acceleration twice or 
even more times per turn, and the orbits roughly follow concentric circles. 
The concept of the cyclotron is shown schematically in Fig. 1.18 [40]. 

According to eq. (1.9), the revolution frequency is 


w=- B, (1.11) 
ymo 


and the momentary radius of the orbit is 


p 


= —. 1.12 
iP. (112) 


'This entails very similar restrictions regarding relativistic effects as in the case 
of the microtron; as before, any deviation from constancy of the magnetic 
field prevents continuous injection of the beam and hence leads to a non- 
continuous outgoing beam. But because the orbits are nearly concentric, it is 
possible to at least partly compensate the relativistic effects by increasing 
B radially in such a way that the revolution frequency in eq. (1.11) stays 
constant. This kind of cyclotrons is called the isochronous cyclotron. If it 
is necessary to accelerate different particles in the same machine, then that 
entails that the actual field profile has to be adjustable, which is usually 
achieved by having one or several trim coils. The superconducting K1200 
cyclotron at the National Superconducting Cyclotron Laboratory (NSCL) at 
Michigan State University, Michigan, USA, allows for such corrections of the 
profile of the magnetic field. 
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FIGURE 1.19: The first model of an FFAG, the Fixed-Field Alternating 
Gradient accelerator. (From L. W. Jones and K. M. Terwilliger, in E. Regen- 
streif, ed., Proc. CERN Symp. High Energy Accelerators and Pion Physics, 
CERN 56-25, 1956 [34]. Courtesy CERN.) 


If continuity of the beam is not of prime importance, it is possible to make 
the necessary relativistic corrections due to eq. (1.11) via a decrease of the 
RF frequency during the acceleration process, which is done in the case of the 
synchrocyclotron. This decrease obviously has to happen very quickly over 
the few hundred turns of the particles while staying within the accelerating 
structure, and thus the pulse frequency can still be rather high. 


A variant of the cyclotrons that were studied intensively in the 1950s was the 
fixed-field alternating gradient (FFAG) accelerator. Fig. 1.19 shows the 
drawing of the first of such an accelerator built. It combines the feature of the 
fixed magnetic field as in a cyclotron and the idea of alternating gradient 
focusing that became widely known in the early 1950s. Although FFAG did 
not flourish as a high energy accelerator, it has generated renewed interest in 
the past decade as a candidate to rapidly accelerate decaying particles such 
as muons and ion beams with large emittance and momentum spread. 


For any accelerator, the ultimate energy limitation comes from the 
strength of the magnetic field that is available as the unavoidable restric- 
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FIGURE 1.20: Sketch of the Bevatron, designed to achieve “Billions of eV 
Synchrotron,” at Lawrence Berkeley National Laboratory, California, USA. 
(From E. J. Lofgren, in E. Regenstreif, ed., Proc. CERN Symp. High Energy 
Accelerators and Pion Physics, CERN 56-25, 1956 [46]. Courtesy CERN.) 


tion 

P Z Bp = Xm. (1.13) 

q 
The range of available magnetic fields is rather limited; typical numbers are 
in the range of 1-2 T for normal conducting dipole magnets, and several 
times more for superconducting dipole magnets. The superconducting dipole 
magnets at the Large Hadron Collider (LHC) at the European Organization 
for Nuclear Research (CERN), near Geneva, in Switzerland and France, op- 
erate reliably at 8 T. (See Table 1.1.) Looking beyond the rather stringent 
requirements for particle accelerators regarding field quality over extended 
regions and temporal stability, as of 2013 the highest magnetic fields that 
can be achieved are about 100 T. In fact, the National High Magnetic Field 
Laboratory (NHMFL), having branches at Florida State University, Univer- 
sity of Florida and Los Alamos National Laboratory (LANL), USA, reached 
100.75 T at the Los Alamos branch in 2012. The Dresden High Magnetic 
Field Laboratory (Hochfeld-Magnetlabor Dresden, HLD) at the Helmholtz- 
Zentrum Dresden-Rossendorf, Germany, reached 91.4 T in 2011, a record at 
the time, and 94.2 T in 2012. 

So for practical purposes, the only way to achieve high energies is to increase 
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FIGURE 1.21: Layout of the Cooler Synchrotron (COSY) ring at the Insti- 
tute of Nuclear Physics (IKP) at Forschungszentrum Jiilich, Germany. (Cour- 
tesy Forschungszentrum Jülich GmbH.) 


the deflection radius p. This represents a significant practical limitation to 
continuous beam accelerators, in which B must be time independent and 
the size of the orbits increases in the acceleration process, since any region in 
which the beam may come has to be covered by magnetic fields. So for really 
high energies, the only realistic option is to have the particles follow the 
same orbit all the time by ramping the magnetic field during acceleration, 
and thus confine the region that has to be covered by the magnetic field. 


Of course this ongoing adjustment of the magnetic field during the accel- 
eration process according to eq. (1.13) to maintain constancy of p prevents 
continuous injection and hence continuous beams. Furthermore, since electric 
field strengths are comparatively more limited, the fields of the cavities have 
to be re-utilized many thousands of times, resulting in a rather stretched-out 
acceleration process, and thus a rather low repetition rate of beam pulses. 


All these thoughts lead to the concept of the synchrotron, in which the 
magnetic field strength is synchronized with the momentary energy or mo- 
mentum of the particle so as to maintain a constant location of the reference 
orbit. The first generation of synchrotrons uses inhomogeneous dipole magnet 
to bend and confine the beam transversely, which is essentially the same as in 
a betatron. The only difference is that the acceleration is achieved through 
RF cavities. Fig. 1.20 shows an example of such a machine. The main limit of 
this kind of synchrotron is that the transverse focusing force from the gradi- 
ent magnet is very weak, resulting in large beam pipes and magnets. For this 
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FIGURE 1.22: Layout of the Super-ACO light source storage ring at Labo- 
ratoire pour l'Utilisation du Rayonnement Electromagnétique, Orsay, France. 
(From M. P. Level, et. al, in Proc. PAC 1987, OSTI ID: 5125784, CONF- 
870302-Vol.1, 470, 1987 [44].) 


TABLE 1.1: — of hadron synchrotrons 


250 GeV 


reason, they are called weak focusing synchrotrons. Alternating gradient 
focusing offered orders of magnitude stronger focusing, much smaller beam 
size and much smaller magnets. Since the mid-1950s, alternating gradient 
synchrotron, also called strong focusing synchrotron, has replaced the 
weak focusing synchrotrons. Nowadays, almost all high energy accelerators 
are strong focusing synchrotrons. Fig. 1.21 shows one example, which is the 
layout of COSY, the COoler SYnchrotron at Forschungszentrum Jiilich [3]. 
Table 1.1 shows characteristic features of some of hadron synchrotrons. 
Shown are the Relativistic Heavy Ion Collider (RHIC), at Brookhaven Na- 
tional Laboratory (BNL), Upton, New York, USA, the Tevatron (1987-2011) 
hosted at Fermi National Accelerator Laboratory (Fermilab, FNAL), Illinois, 
USA, and the LHC with their approximate dimensions, the maximum ener- 
gies (per nucleon for ions) for which they are designed, and particles to be 
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FIGURE 1.23: Layout of the Advanced Light Source (ALS) at Lawrence 
Berkeley National Laboratory, California, USA. (From Document Control 
Center of Lawrence Berkeley National Laboratory, Print number: 22Q2593, 
1989. Courtesy Lawrence Berkeley National Laboratory.) 


accelerated. 

The storage ring is not an accelerator in the traditional sense, since it 
holds the energy of the stored beam constant; however, it does not necessarily 
mean that RF cavities are not needed. In fact, due to synchrotron radiation, 
all the electron and the high energy proton storage rings use RF cavities to 
maintain the energy of the beam. Naturally, a synchrotron often can play the 
role of a storage ring as well. 

The time that the particles stay in the storage rings ranges from minutes 
to days. In the case of the Tevatron, where the circumference is 6.28 km, the 
time for one operation while having collisions for high energy experiments is 
about 8 hours, which is 28800 sec. So the particles circulate through the ring 


_ 3 x 105m/ sec -28800 sec 
i 6.28 x 109m 


As a comparison, the operation duration without collision is more than 100 
hours at the Tevatron. The LHC has similar numbers. Thus, even more so 
than in the case of the synchrotron, one of the main design problems and 
physically perhaps the greatest challenge is to try to ensure that particles 
actually stay contained over this large number of turns. Because the motion 
is nonlinear, this immediately leads to questions of nonlinear dynamics with 
all their complicated and interesting aspects. 

One of the applications of storage rings is the collider, where counter 
rotating beams are brought to collision at various points around the ring. At 


z 10? turns. 
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very high energies, colliders have a significant energy advantage over fixed 
target machines because a very large fraction of the beams' energies can be 
converted to reaction energy. As a detailed study of the relativistic dynamics 
shows, this is not at all the case for fixed target cases; in fact, conservation 
of energy and momentum severely limits the energy that can be set free. 
Large scaled circular colliders are those listed in Table 1.1, and the tunnel 
with circumference 27 km, hosting the LHC currently, was earlier used for 
the Large Electron-Positron Collider (LEP) (e*,e^; ~ 100 GeV, 1989-2000). 
Besides the energy advantage, storage rings also have the disadvantage of the 
slow ramping times typical for synchrotrons; however, once the beam is stored, 
it is essentially continuous again. 

But also for situations that require the beam to hit a fixed target, storage 
rings often offer an advantage over the use of synchrotrons by themselves, 
because it is often possible to extract the beam much more slowly than in the 
case of the synchrotron, resulting in a more easily manageable duty cycle and 
reducing the problem of overflowing the electronics in the detectors. In this 
method of ultra-slow extraction, the nonlinear dynamics of the device is 
adjusted very carefully and gently, as over time a larger and larger part of the 
originally stored emittance becomes unstable. If it is possible to control the 
location around the ring where the spilling occurs, then the spilled particles 
can be directed toward the fixed target as needed. One storage ring where 
this approach is utilized is COSY, the cooler synchrotron and storage ring, at 
Forschungszentrum Jülich, Germany, shown in Fig. 1.21. 

Another application of the storage ring that has become one of the most 
productive tools for scientific research is the synchrotron light source. 
Although not as majestic as the giant high energy colliders, there are many 
synchrotron light sources throughout the world, and each facility hosts many 
users from almost all disciplines of the sciences. In the light source, the probe 
for the experiments is the light (from far infrared to hard X-ray) radiated 
by the electrons when the orbits are bent in the ring, which is generated 
through the process of synchrotron radiation. Figs. 1.22 [44] and 1.23 
[42] show a couple of synchrotron light sources. As the electron mass is so 
small, in principle, any bending magnet can be used to produce light due 
to synchrotron radiation. But in addition, in the straight sections of a light 
source ring, often wigglers and undulators, which consist of alternating 
short bending magnets, are placed to produce more intense and coherent light. 
In such a way, each light source ring can hold tens of light beamlines, much 
more than the number of interacting locations that a high energy physics or 
nuclear physics collider can have for the collider experiments. 


Chapter 2 


Linear Beam Optics 


In the discussion of the basic physical principles of the various types of ac- 
celerators, we casually neglected the fact that it is necessary to take care of 
more than one particle. In fact, all the above accelerators have to be able 
to simultaneously deal with an ensemble of particles with similar phase space 
coordinates, which is what the sources deliver, and hence with a beam. As 
outlined above, a detailed understanding of the motion of the beam requires 
the study of the motion of the reference particle as well as the motion of 
the relative coordinates. 


In the case of accelerators, our demands on the relative motion are mostly 
that the beam does not become unreasonably large, and hence that the motion 
is somehow bounded within a suitable volume of phase space. While this 
appears to be a modest wish for long single pass accelerators, and more so for 
repetitive systems, this problem actually turns out to be rather nontrivial. 


For other types of systems, more specific requirements have to be made for 
the beam. For example, to maximize the number of collisions at an inter- 
action region of a collider, it is important to “squeeze” together the spatial 
coordinates of the beam, which under conservation of phase space volume 
then requires the momentum coordinates becoming large. Devices like parti- 
cle spectrographs or electron microscopes have different and often even more 
involved requirements. 


In all of these cases, it is important to study the relative motion carefully. 
As a first step, the motion is linearized, and for higher precision, the nonlinear 
effects of the motion have to be studied. Because the volume in phase space 
occupied by a beam is small, these nonlinear effects are often treated in a 
perturbative way, in which the first order corresponds to linear motion, 
and nonlinear motion appears as higher order (see Table 2.1). 


TABLE 2.1: Classification of effect 


zeroth order motion of reference particle 
first order linear motion 
second+higher orders nonlinear motion 
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Reference Orbit 


FIGURE 2.1: Reference orbit, arc length s along it and local coordinates. 


2.1 Coordinates and Maps 


Usually when studying dynamics, the time £ plays the role of the indepen- 
dent variable, and we study the motion of positions i and velocities v or mo- 
menta p as coordinates. Using the Lagrange mechanism, it is easy to transfer 
to new coordinates, in particular the coordinates that describe the relative 
dynamics around the reference orbit. Furthermore, instead of using t, we 
usually use the arc length s along the reference orbit as an independent 
variable. Fig. 2.1 illustrates the concept. 


For the understanding of the motion in relative coordinates, let us assume 
we have studied and understood the motion of the reference orbit. In case 
there is no field at all, this reference orbit will merely follow a straight line. 
Furthermore, there are many devices used in accelerators that have fields, but 
along one given straight line, all the fields vanish, and the device is lined up 
in such a way that the reference particle follows this line. Another important 
device uses magnetic fields, and along the reference orbit one tries to hold 
the magnetic fields constant, in which case the reference orbit is circular, at 
least within the element. In all other cases, it is usually necessary to describe 
the reference orbit by numerically integrating the equations of motion. 


We assume the position and momenta of the reference particle 7yer($), prer(s) 
are known. Here the momentum p is the dynamical momentum as in eq. 
(1.4). As a technical detail, let us also assume that for all points s, we have 
Dret(S) Y EzLab, i.e., the motion is never pointing vertically straight (which 
for most real accelerators is no limitation whatsoever). Let furthermore riube 
be smaller than the minimum radius of curvature that the reference orbit 
experiences in the section of the device that we want to study. We now 
consider a “flexible tube" of radius rjype centered around the reference orbit, 
and restrict the particles that we want to describe to only those within the 
tube, as Fig. 2.2 illustrates the situation. Again, for practical devices this 
represents hardly a limitation; for example, in the LHC (see Table 1.1), the 
"tube" would be more than 2 km wide, much larger than the region required 
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FIGURE 2.2: Motion of particles inside the tube with radius riype around 
the reference orbit. 


by the beam particles. 

For any particle within the tube, there is now a closest point on the 
reference orbit; because only particles within the tube are allowed, this point 
is indeed unique. Let s be the arc length at this point, and r;e:(s) the position 
of the reference particle on the reference orbit. Then the relative coordinates 
of the point 7 are obviously F — rier(s). 

Let now €, be a unit vector in the direction of Pref. Consider now the plane 
perpendicular to é,. Of all the unit vectors in this plane, let y be the one with 
the largest vertically upward component; because in our setup Pref and hence 
€, are not allowed to go vertically straight, this vector is well defined. Finally 
choose a third vector & as €; = Ey x €,. Because €, has a maximum vertically 
upward component, & has a vanishing vertical component and hence lies in 
the horizontal plane. 

Denote now by “x” the component of 7 — r;er(s) in the direction of €z, and 
by “y” the component of 7 — Trets) in the direction of é,. Similarly, define p; 
and p, to be the momentum components of p — Pref in the directions €; and 
Cy. 
Using {x, Px, Y, Py}, the motion in the transversal plane, defined by & and 
Ey, can be described, and it is called the transversal dynamics. However, 
considering how a beam is formed as we have seen in Chapter 1, we have to 
consider that the energy of a particle E in the beam can be different from 
that of the reference particle Eyer, even if it is only slightly so. The energy 
difference of the particles as well as the geometry of the orbits also results in 
the difference of the travel time t of the particles, called the time-of-flight. 
Thus, the energy and the time-of-flight have to be considered when studying 
the motion of a beam, and it is called the longitudinal dynamics. 

As we will see later, the transversal motion and the longitudinal motion are 
in general coupled, except for special cases. Altogether, we will describe the 
motion of the beam in six coordinates [z, pz, y, py, E, t). The actual choice of 
coordinate quantities requires a careful consideration, as it eventually deter- 
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mines how widely the resulting derivations are compatible with general con- 
cepts in physics and mathematics. Below, the quantities with the subscript 0 
are meant to indicate the reference particle. 

Before launching the motion, we denote the energy deviation of a particular 
particle of interest by ô by defining 


where K is the initial kinetic energy of the particle under consideration while 
Ko is that of the reference particle. Finally, we introduce a space-like variable 
l 

l=k (t = to) 


being the deviation of the time-of-flight t from that of the reference particle, 
multiplied by a constant « that has the dimension of velocity. Specifically, 


l (2.1) 


using the absolute value of the velocity of the reference particle vo and the 
associated yo, which can be expressed as 


1 _ V(Poc)* + (me)? _ Eo 


Yo — —— M 
vV1-w/c "ie me?’ 


by referring to eq. (1.6). The specific form of «, especially the fractional factor 
involving yo, is important for generating what turns out to be a canonical pair 
of coordinates (l, ô); the details go beyond the scope of this book, and we refer 
to [5] for details. 

Then we form the vector Z of particle optical coordinates as 


x 
a = ps/po 

7 y 

Z- 2.2 
b = py/po : (22) 
l = &(t — to) 
ô —(K — Ko) /Ko 


where po is some previously chosen scaling momentum; a natural choice is 
to select the momentum of the reference particle at the beginning. Likewise, 
Ko is a previously chosen scaling energy, for example the kinetic energy of 
the reference particle, and similarly, & is a scaling quantity introduced in eq. 
(2.1). 

Note that due to the definition of Z , the reference particle itself corresponds 
to Z = 0, and hence the vector Z does indeed describe the relative motion. 
In a seemingly simple way, most of the problems of beam physics now revolve 
around the question as to how Z evolves as a function of s. 
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In light of this, the entire action of a beam physics device can now be 
expressed by how it manipulates the coordinates in Z.In fact, usually a set of 
initial conditions Žo at position so uniquely determines the future evolution 
and hence Z at any later position s. While a common notion, mathematically 
this determinism of classical mechanics rests on some subtle assumptions 
about the details of the fields that are allowed in the motion; but further 
details are beyond this book. 

Assuming that indeed Zo at so uniquely determines the future evolution, 
we can define a function relating the initial conditions at so to the conditions 
at s via 


Z (s) = M (s, so) (Z (9). 


The function M (so, s), which formally summarizes the entire action of the sys- 
tem, is of great importance for the description and analysis of beam physics 
systems. It is often called the transfer function, the transfer map, or 
simply the map of the system. Note that the transfer maps satisfy the rela- 
tionship 


M (82, 81) o M (81, so) = M (82, so), (2.3) 


which merely says that transfer maps of systems can be built up from the 
transfer maps of the pieces. 
Since M describes the motion in relative coordinates, we always have 


M(0) = 0. 


Furthermore, since by the very definition of a beam, the coordinates of Z are 
“small,” M is usually only weakly nonlinear. Because of this, its deter- 
mination and analysis is very amenable to perturbative techniques. The 
first step in this process is to consider only the linearization M of M, the 
so-called linear map. Let M = M-—M be the remaining purely nonlinear 
part, so that we have 


M 2M +N. 


'The linear map M is simultaneously the most important and the easiest to 
study. The treatment of the nonlinear part M is much more complicated, and 
only later in the book will we address a small part of the problems associated 
with its treatment. More details can be found, for example, in [5]. 

In the following section, we will make a short excursion to a field that at 
first glance appears disconnected from beam physics, namely the field of glass 
optics. However, a closer look shows that glass optics, which has existed long 
before the name beam physics was introduced, certainly belongs to this field: 
the ensembles of light particles or rays typically associated with questions of 
glass optics form a beam not only in the conventional meaning of the word, 
but also under the more formal definition. 
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2.2 Glass Optics 


As one may recall from a basic course in optics, a distinction is made be- 
tween so-called *Gaussian optics," which indeed turns out to just mean 
linear motion, and “aberrations” that describe nonlinear effects. Optics has 
developed its very own jargons and techniques, some of which are connected 
to complicated geometric ideas, and in our opinion it is historically unfortu- 
nate that optics has not been treated with the methods of the transfer map. 
We shall remedy this situation here by simultaneously providing a short intro- 
duction on Gaussian optics in an appealing and unified way, and also develop 
our skills in dealing with linear maps. 

For simplicity, let us restrict ourselves to systems that are rotationally sym- 
metric, like most glass optical systems; it will be quite clear as we go what has 
to be done to treat non-rotationally symmetric systems. In this rotationally 
symmetric case, two variables are enough to study the motion; we here choose 
them as the position x and the slope a of a ray. The transfer map of an optical 
system then expresses how (x,a) behave as they transfer a system, and we 


have 
ag a1 


In fact, if we restrict ourselves to linear motion, then this can be expressed in 
terms of a transfer matrix 


Note that the notation for the matrix elements is such that the quantity 
before the vertical line “|” describes the row, and that after the vertical 
line describes the column. We remind again that knowing matrices of pieces 
allows the computation of matrices of more complicated systems, which is 
here achieved by mere matrix multiplication. Indeed, if Ma through Mi, are 
the matrices for the subsystems, then because of the associativity of matrix 
multiplication, we obtain for the ray after the last subsystem: 


(2) (2))-) =e) (2) 
An+1 ay ai 

So we have shown that the matrix of a combined system equals to product 
of matrices of subsystems. Since especially on computers it is very simple 
to multiply matrices, this is the method of choice for the basic design of 


optical systems. In the following, we hence derive the forms of the matrices 
of common optical elements. 
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FIGURE 2.3: A ray passing through a drift. 


2.2.1 The Drift 


The simplest part of glass optical elements is a region which does not contain 
any material, the drift. The final position and slope x2 and a» after a drift 
of length | can be connected very simply to the initial values zı and aj, as 
shown in Fig. 2.3 


z2 = 11-4 a4-l, ag = aj. 


This obviously can be written in a matrix form as 
X2 = 1 l X1 
a2 z 0 1 ay ` 
For the later discussion it is important to note that the matrix (4 1 ) depends 


only on the characteristic properties of the element, which here is the length 
l. On the other hand, the vector (zx1,a1) depends only on the parameters of 
the ray. Altogether, a drift performs a linear transformation in x, a space. 
Note that the determinant of the drift matrix is unity. 

As a small exercise, let us now consider a combination of two drifts of lengths 
lı and l2. For the value of the coordinates (13,83) after the combination of 
the two drifts, we have 


Eso sona d OU 


Here the necessary composition of maps just reduces to a common multi- 
plication of transfer matrices. And the result is not surprising, the effect 
of two subsequent drifts is just the same as that of a drift of the combined 
length. 


2.2.2 "The Thin Lens 


Besides empty space, glass optical devices contain lenses that change the 
direction of the light ray. Here we are primarily interested in the thin lens, a 
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lens 


FIGURE 2.4: A bundle of parallel rays passing through a focusing lens. 


somewhat idealized device without any length, which is characterized by the 
following facts that are also illustrated in Fig. 2.4. 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light is unified in one point a distance f after the 
lens. 


3. A ray lighting the center of the lens goes straight through. 


The quantity f that describes the lens is called the focal length. Let us 
now consider a ray passing through the lens. From Fig. 2.4 we find 
T2 = T1, p—f:a, zı +a2: f =p, 


from which we infer 2: 
1 
T2 = T1, ag = —— + 4}. 


f 


This relationship can be written in a matrix form as 
T2 EN 1 0 Tı 
(a) = (ars 3) Ga): ea 
As in the case of the drift, the matrix (ay 2) depends only on the focal 
length f, the characteristic property of the lens, whereas the vector (z1,a1) 


depends on the ray. Note that the determinant of the matrix Cay f) is 


unity. 

The simple thin lens we have discussed here, the so-called focusing Gaussian 
lens, represents quite an approximation for several reasons. First, any real lens 
performs a refraction at two different surfaces, so positions do change as one 
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FIGURE 2.5: A bundle of parallel rays passing through a defocusing lens. 


goes through the lens. Furthermore, for most lenses it is not really true that 
parallel rays all meet at a point a distance f behind the lens. This is connected 
to the fact that lenses are usually ground with spherical surfaces because 
anything else is technically difficult. Furthermore, the glass has dispersion, 
so different colors are affected differently. We note however that Snell’s law 
still allows us to determine the true transfer map of a thick, spherical lens 
in a rather straightforward way. It is important to note, however, that this 
transfer map will no longer be linear. 

Quite interesting is the combination of two glass lenses, which can ap- 
parently be described by multiplying their matrices. Note that, always, the 
matrix of the first element is on the right. We obtain 


m tI en "A 2 


So the combination of two lenses provides the same effect as one lens with 
focus length f, where 1/f = 1/fı + 1/fa. This is of course a famous law of 
optics, the derivation of which is all but trivial in the matrix context. Indeed 
the efficiency of the matrix approach becomes clear when observing how to 
prove this law using the standard geometric method of optics textbooks. 

In a similar way as the focusing thin lens we can also treat the defocusing 
thin lens. In this case, the basic properties can be found as illustrated in Fig. 
2.5. 


1. Positions are not changed, but directions are changed. 
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2. Any bundle of parallel light exits the lens in such a way that it appears 
to come from a point a distance f in front of the lens. 


3. A ray lighting the center of the lens goes straight through. 


In a similar way as before, we can use basic geometry to determine the 
action of the lens. From Fig. 2.5, we find 


T2 = T1, p=-f: a, p = T2 — fag. 


Similar to before, we obtain a» = z1/f + a1, which is in a matrix form 


X2 u 1 0 X1 

ag E 1 / f 1 a1 i 
This is essentially the same matrix as before, except that now the sign of 
the matrix element (a|x) has changed. Indeed, using the standard convention 


to count defocusing lenses with a negative focal length, the matrix has even 
exactly the same form as before. 


2.2.3 The Thin Mirror 


Besides lenses, mirrors are probably the second most important optical 
device, and there are also focusing and defocusing mirrors. Different from 
the lens, the reference orbit flips direction when hitting the mirror. A thin 
focusing mirror is defined by what it does to an ensemble of parallel light via 
three conditions, illustrated in Fig. 2.6. 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light that is reflected by the mirror will meet in 
a point a distance f in front of the mirror. 


3. A ray hitting the center of the mirror is reflected such that its outgoing 
angle equals its incoming angle. 


A similar argument as in the case of the focusing lens shows that the transfer 
matrix of the focusing mirror is 


a= ( Y, D 


There is also a defocusing mirror, defined by three conditions: 


1. Positions are not changed, but directions are changed. 


2. Any bundle of parallel light that is reflected by the mirror seems to 
emerge from a point a distance f behind the mirror. 
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mirror 


FIGURE 2.6: A bundle of parallel rays is reflected by the focusing mirror. 


3. A ray hitting the center of the mirror is reflected such that its outgoing 
angle equals its incoming angle. 


A similar argument shows that also in this case, we have the transfer matrix 


^ 1 0 
E 
where the convention to count the focal length f of a defocusing element 
negative is used. 

So apparently mathematically, lenses and mirrors behave the same, aside 
from the fact that they reverse the reference orbit. The choice of which to use 
in practice depends on a variety of practical factors. For situations requiring 
only small apertures like in most camera lenses, glass lenses are easily made, 
and have an advantage because of the straight beam path. For situations 
requiring large apertures, like in big telescopes, mirrors are the primary choice 
because it is much easier to manufacture and support large mirrors than large 
lenses. It is also easier to produce non-spherical shapes for mirrors than for 
lenses. Finally, mirrors have the additional advantage that they treat light of 
different colors equally; they do not show the dispersion commonly observed 
in glass lenses. 


2.2.4  Liouville's Theorem for Glass Optics 


As a direct consequence of the matrix notation for glass optics introduced 
above, for any combination of lenses, drifts and mirrors, we can prove a special 
case of Liouville's theorem: The volume of phase space occupied by the beam 
is conserved. 

Indeed, let us assume that we have an optical system consisting of n ele- 
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FIGURE 2.7: Liouville's theorem. The volume of phase space occupied by 
the beam is conserved. 


ments with matrices Mi. 'Then we have 


(en) e C) )) = Ge) (2). 


The determinants of each of the matrices M; are just unity, as they are all 
either drifts, lenses or mirrors, so the determinant of the product is unity. 
Under linear transformations, volumes in space transform with the size of the 
determinant, thus the volume is indeed conserved. Fig. 2.7 illustrates this 
situation. 

An interesting and remarkable consequence of Liouville's theorem is the 
famous recurrence theorem of Poincaré. Let us assume we have some 
motion in n-dimensional phase space and also that we know that the motion 
is bounded in all phase space variables. Let us further assume that the mo- 
tion obeys Liouville's theorem, which as we shall see later is the case for all 
Hamiltonian systems, and let the motion be deterministic. Then Poincaré's 
recurrence theorem states that for any given &, the system after sufficient time 
comes back to its original state within a tolerance of at most e. 

Before we sketch the proof of Poincaré's theorem, let us illustrate some of 
its consequences. Consider for example a box with classical gas particles that 
are initially all located in one side of the box and kept there by a wall as 
shown in Fig. 2.8. After the wall is removed, the gas particles will distribute 
in the box evenly, as we expect from classical statistical mechanics, increasing 
their entropy. But their phase space is bounded, as the particles cannot leave 
the box, and each particle's momentum is limited by the total heat energy 
contained in the box. 

So as time progresses, according to Poincaré, they will at one time in the 
future just recollect on one side of the box, and by re-inserting the wall, 
they will be caught again on one side, in crass contradiction to the entropy 
principle. 

'There are many other examples. If we have a particle beam in an accelerator 
that we know is stable, it will eventually come back as close as we want in 
phase space, which is an effect that is actually observed somewhat routinely in 
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FIGURE 2.8: Poincaré’s recurrence theorem. After sufficient time, the 
system returns close to its original state. 


tracking pictures. Even for our daily life, there are important consequences. 
If the universe is Hamiltonian and it does not expand indefinitely, then up to 
minute details, history will keep repeating itself. So we will all be born again, 
and we will all make the same mistakes all over, but since now we cannot 
remember anything about our past life, also next time we will not remember 
our current life. 


Now let us sketch the proof of the recurrence theorem. Let an & be given, 
and consider an &-ball with volume V; in phase space. Consider its motion by 
regular time steps At. Since the total available phase space volume is finite, 
say Vp, after at most V,/V- time steps, the image of the ball must reach a 
part of phase space it has touched before, i.e., it must overlap a previous 
image of the ball. Let us assume this happens after N steps and the previous 
image is that after n steps, with n < N. But if the images after n steps In 
and after N steps Iw overlap, so must the images after (n — 1) and (N — 1) 
steps, respectively. And continuing backwards, so must the images after 0 and 
(N — n) steps; hence, after (N — n) steps, we touch the original -ball again. 


2.3 Special Optical Systems 


In this section we want to apply the matrix techniques to the study of 
certain special categories of systems. In particular, we associate certain fun- 
damental properties of systems with properties of the matrix. We begin with 
the imaging systems. 
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FIGURE 2.9: Sketch of an imaging system. 


2.3.1 Imaging (Point-to-Point, e e) Systems 


Imaging systems or point-to-point systems are perhaps the most important 
systems in optics, and they deserve some special attention. Suppose we study 
the action of a slide projector. At one end of the projector, light is sent 
through the slide. Suppose the slide shows a man wearing a gold earring. 
The image of this man is to appear on the screen, and the gold earring is to 
appear at one particular location. This requires that all light passing through 
the golden spot on the slide and emanating in various directions has to be 
re-united at one spot on the screen, as shown in Fig. 2.9. 

This means that the final position of a ray is independent of its initial angle 
and it only depends on the initial position. In terms of transfer matrices 


this means that the element (r|a) has to vanish: 
(xja) = 0. 


Obviously the element (|x) also has an important interpretation: it is the 
magnification of the system. 


(x|r): magnification. 


Besides the case of the slide projector, many other devices use imaging. 
They include the camera, the overhead projector, the eye, the photographic 
microscope, the electron microscope, as well as particle spectrographs. We 
will discuss in detail some such devices in Chapter 7. 

It is worthwhile to study how imaging systems can be made. First, we 
observe that a drift is imaging if and only if | = 0, while it is a rather boring 
choice. A single lens is also always imaging as long as there are no drifts 
before and after, but that is another boring choice. The first interesting 
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imaging system is the DLD (drift-lens-drift) system, consisting of a drift, a 
lens and another drift. The transfer matrix of the DLD system is given by 


Er ee dpi 


| (i-h/f h+bh-hb/f 
BUS 1- h/f l 


If such a system is supposed to be imaging, we have to satisfy (a|a) = lı + 
l2 — lıl2/ f = 0, which is equivalent to 


1 " 1 1 
hd f 
'This is another important result of conventional optics, which here is obtained 


in an almost trivial way. If the DLD system is made to be imaging, the 
magnification is given by 


This principle is used in several different devices. In the slide projector, 
lj is very small and lə is very large, thus it provides a large magnification. 
Probably the most important imaging system is the eye. Here the situation 
is just the opposite. lı is large and lə is small, allowing for large things to be 
mapped on the small retina of the eye. 

It is interesting to study the combination of two imaging systems: 


prs m (an e) = iae PE ata} 


As is to be expected, the total system is again imaging, and the magnifica- 
tion is (z|z)o(z|x)1, just the product of the individual magnifications. 


2.3.2 Parallel—to—Point (|| e) Systems 


As we saw above, the human eye observing a nearby object is one of the 
prime examples of an imaging system. But what happens if the eye looks at 
things farther and farther away, in particular at the stars, a pastime of the 
human race and scientists for eternity? The length of the first drift |j becomes 
larger and larger, and for all practical purposes the light coming from one star 
reaches the eye as a parallel bundle. So what the eye is to interpret now is 
the angle under which the light comes in, and hence the position on the retina 
should depend only on the initial angle at which the light strikes the eye, but 
not on the initial position. 
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FIGURE 2.10: Sketch of a parallel-to-point system. 


FIGURE 2.11: Sketch of a point-to-parallel system. 


This is an example of parallel-to-point systems as illustrated in Fig. 2.10. 
A parallel-to-point system requires that 


(x|v) = 0. 
If we look at the eye as a DLD system, this requires 


(z|r) =1—-h/f —0 la — f, 


while lı is arbitrary. Thus the retina has to be exactly at the focal length; 
almost as important is that the distance to the object is arbitrary since we 
cannot change our distance to the stars significantly. Another important 
parallel-to-point system is the photographic camera. 


2.3.3 Point—to—Parallel (e |) Systems 


Another important class of systems is the point-to-parallel systems. In 
these systems, the final slope depends only on the initial position, but not on 
the initial slope as illustrated in Fig. 2.11. So we have 


(a|a) — 0. 


Examples include the flashlight, the microscope, and laser and particle beam 
transports over long distances such as those considered for the SDI Transport 
(Strategic Defense Initiative) considered by the US government in the 1980s. 

As an example, let us try to achieve a point-to-parallel system with a DLD 
combination. We obtain 


(ala) - 1— h/f —0 h=f, 
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FIGURE 2.12: Sketch of a parallel-to-parallel system. 


as we may have expected. Note that there is no condition on l5. 

From the transfer matrices, it follows rather directly that the combination 
of a point—to-parallel and a parallel-to-point system forms a point-to-point 
system. 


Las ak tae), n JS Latet DE aol tds 


Using the relaxed eye as the parallel-to-point system, we can thus build a 
microscope by placing a suitable point-to-parallel system in front of the eye. 
It is interesting to see how the lengths in a point-to-parallel system have to 
be chosen; by requiring (a|a) = 0, we obtain lı = f, while lg is arbitrary. The 
first part is as expected; the latter part is fairly important for the operation of 
a microscope because it allows the eye to move with respect to the microscope. 


2.3.4  Parallel-to-Parallel (|| ||) Systems 


The final important system is the parallel-to-parallel system illustrated in 
Fig. 2.12. By placing it between the eye and the stars, a magnification of 
angles can be achieved. This is the principle of the telescope. 

'The system has to be such that the final slope depends on the initial slope, 
but not on the initial position, which requires 


(ajx) = 0. 
The magnification is given by (ala). 
(ala): magnification. 
If we try to achieve this with a DLD system, then we have to satisfy (a|x) = 
—1/f to be 0, which is impossible. This entails that a telescope has to contain 


at least two lenses. 
So let us consider an LDL (lens-drift-lens) system. 


MS eT je en 1)= Cu s U fifa 273! 
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And we have to satisfy 


PM ONE M 
fo h fife 


which is a well-known condition for Newtonian or Galilean telescopes. The 
magnification of the telescope is given by 
l 1 fı 

aja) = 1— — = 1 = 1 l 2) — 

(ala) =1— zc 1- zi f) 
Thus, it requires fı >> fo to obtain large magnification. Since there is a limit 
on how short fə can be, it is thus necessary to make fı large, which entails 
what the rather large size telescopes usually have. 


(a|z) = — =0 = l= fit fo; 


2.3.5 Combination Systems 


Often the question arises to what extent it is possible to simultaneously 
satisfy the requirements for the above systems. To some extent this is possible, 
but the fact that the determinant of the total system has to be unity due to 
Liouville's theorem for glass optics imposes some restrictions. 

A closer look shows that 


1. e e and || || is possible: (z|a) = (a|z) = 0. 


2. || e and e || is possible: (x|z) = (ala) = 0. 


All other cases are impossible because they would require a zero deter- 
minant. 


Another important question is what happens when two systems satisfying 
certain properties are combined into one system; for example, we already saw 
in eq. (2.5) that two point-to-point systems placed behind each other again 
produce a point-to-point system. A more detailed analysis shows that of 
the sixteen cases describing combinations of two systems, eight cases lead to 
another special system 


eecee—cec, |I-ll-ll:; 
ese wq ees 
elt lees, le ref=l ls 
€ | Fleiss. l|e+ee=]||o. 


The entries in the table above are easy to memorize because it contains just 
those combinations for which the second symbol of the first system equals the 
first symbol of the second system, and the final result is obtained by dropping 
the two identical symbols. So in compact notation, we have: 


If A,B,C € {e, ||}, then AB+BC=AC. 


Chapter 3 


Fields, Potentials and Equations of 
Motion 


For the study of transfer maps of particle optical systems, first it is necessary 
to undertake a classification of the possible fields that can occur. All fields 
are governed by Maxwell’s equations, which in SI units have the form 


" = >» 9D 
div B =0 lH =j + 
iv , cur gb DE” 
= 4 B 
div D =p, curlk = m (3.1) 


In the case of particle optics, we are mostly interested in cases in which 
there are no sources of the fields in the region where the beam is located, so 
in this region we have p — 0 and j — 0. Of course any beam that is present 
would represent a p and à j, but these effects are usually considered separately. 

In the following, we want to restrict ourselves to time independent sit- 
uations, and neglect the treatment of elements with quickly varying fields 
including cavities. This limitation in very good approximation also includes 
slowly time varying fields like the magnetic fields that are increased during 
the ramping of a synchrotron. 

So, Maxwell's equations simplify to 


div B = 0, cur H- 0, 
divD=0, curl Ë =0, (3.2) 
where 3 
B= bof, D= EoE. 
Because of the vanishing curl, we infer that E and B have scalar potentials 
Vg and Vg such that 


EÉ-2-VVg, B--VVg. 


Note that here even the magnetic field is described by a scalar potential, 
and not by the vector potential A that always exists. From the first and third 
equations of (3.2), we infer that both scalar potentials Vg and Vg satisfy 
Laplace's equation, and we thus have 


AVg 20, AVg-0. 
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In order to study the solutions of Laplace's equations for the electric and 
magnetic scalar potentials, we will proceed for two special cases, each of which 
will be treated in a coordinate system most suitable for the problem. 


3.1 Fields with Straight Reference Orbit 


'The first major case of systems is those that have a straight reference orbit. 
In this case, there is no need to distinguish between particle optical coordinates 
and Cartesian coordinates, and in particular there is no need to transform 
Laplace's equation to a new set of coordinates. Many elements with a straight 
reference orbit possess a certain rotational symmetry around the axis of 
the reference orbit, and it is most advantageous to describe the potential in 
cylindrical coordinates with a longitudinal z-axis that coincides with the 
reference orbit. 


3.1.1 Expansion in Cylindrical Coordinates 


We first begin by expanding the r and ¢ components of the potential in Tay- 
lor and Fourier series, respectively. However, the dependence on the cylindri- 


€ un 


cal “z” coordinate, which here coincides with the particle optical coordinate 
5, is not expanded. So we have 


V =V(r,¢,8) = M M Mi (s)cos(ló + 04,1) r*. (3.3) 


k=0 1=0 
In cylindrical coordinates, the Laplacian has the form 


ovy iov av 
"Or r2? 0g? Os?’ 


10 / OV 18V ƏV 
= rT — + — —— —— cL 

Or r? 0¢? 

We insert the Fourier-Taylor expansion of the potential (3.3) into each term 
of the Laplacian. 


ƏV 
rgy T" 2 2 Mia(s) cos (lo + 64,1) kr 
k=1 1=0 
= Mpls) cos (lo + 0.1) kr”, 
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then the first term is 


18 / ƏV | Pe 
r Or CZ) E =) Mails) cos (lØ + 01.1) k?r^-! 


where we let the sum start at k = 0 in the last step, since there is no contri- 
bution anyway because of the factor k?. The second term is 


EQ pu Em 
es ` 25 Mx,1(s) cos (l + 0,1) Pr® 


2572 2 
r? 06 T eoo 
=-->) Mpg (s) cos (lb + 04,1) Pr*-?. 
k=0 1=0 


The third term is 


- M Ms ) cos (lo + 04,1) r^ 


k=0 1=0 
= 5 5 My_2,1(8) cos (ld + Ox—2,1) pe 
k=2 1-0 
zy Y WE 24 (5) cos (lØ + 0,21) r^^, 
k=0 1-0 


where we let the sum start at k — 2 in the second step, and, further, in the last 
step, we used the convention that the coefficient Mp (s) vanish for negative 
indices. Recognizing that all the terms have the common summations and 
the factor r*~?, we obtain the Laplacian for the Fourier-Taylor expansion of 
the potential (3.3) as 


ns > [Mr (8) cos (16--05.,) (k? — 1?) + Mg S. (s) cos (16+ Ox-2,1)| p 
k,l=0 


To satisfy Laplace’s equation, we obtain a set of conditions for k,l > 0, 
Mxy(s) cos (ld + 0x.) (k? — I?) + My (s) cos (ld + Ox 21) = 0, 


where the second term vanishes for k = 0, 1 because of the negative indices 
for Mk,- 

We begin the analysis of Laplace’s equation by studying the case k = 0, 
where only the first term matters. Apparently Mo, and 60,9 can be chosen 
freely because k? — |? = 0 for k = l = 0. For l > 1, we infer Mo, = 0. 
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By induction over k, we now show that M;,; = 0 for all cases where k < l. 
Apparently the statement is true for k — 0 as we just showed. Now let us 
assume that the statement is true up to k— 1. If k < l, also k—2 < l, and thus 
Mj. 4,(s) = 0. Since k? — I? Z 0 and cos (ló + 0,1) # 0 for some ¢ because 
l £0, this requires 

Mxy.(s) =0 for k<l. 


Thus the infinite matrix Mrz; is strictly lower triangular. 
We now study the situation for different values of l. We first notice that for 
all l, the choices of 


Mii(s) and 0j; are free 


because Mj' (s) = 0 by the previous observation, and k? — 1? = 0 for k — I. 
Next we observe that the value Mj411(s) must vanish, because k? — I? Æ 0, 
but Mf ,,(s) = 0 because of the lower triangularity. Recursively we even 
obtain that 


Mi4i.(s), Mizsi(s),... vanish. 


On the other hand, for k = l+ 2, we obtain that 0142, = 61, and Mj+2,1(s) 
is uniquely specified by Mi(s). Applying recursion, we see that in general 


Ori = 42,1 = O44 — s 
2n 
ME (s) 


Mi+2n,i(8) = Wa (P - tem) 


(3.4) 


Let us now proceed with the physical interpretation of the result. The 
number / is called the multipole order, as it describes how many oscillations 
the field will experience in one 27 sweep of ¢. The free term Mij(s) is called 
the multipole strength, and the term 6), is called the multipole phase. 
Apparently, frequency l and radial power k are coupled: The lowest 
order in r that appears is l, and if the multipole strength is s-dependent, also 
the powers l + 2, L+ 4,... will appear. 

For a multipole of order l, the potential has a total of 2l maxima and 
minima, and is so often called a 2/ pole. Often Latin names are used for the 
2l poles, and they are listed in Table 3.1. 

In many cases it is very important to study the Cartesian (and hence also 
particle optical) form of the fields of the elements. We start with the trivial 
case with k = 1. In this case, the potential is V = M1, cos (ġ + 011) r. For 
0,1 = 0, we obtain V = Mj, - x, which corresponds to a uniform field in 
x-direction. For 014, = 7«/2, another important sub-case, we obtain V = 
—Mhi,1: y, which corresponds to a uniform field in y-direction. In both of 
these cases, the reference orbit is indeed a straight line only in the limit of 
weak fields. 
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TABLE 3.1: A list of multipoles 
l Leading Term in V Name 


0 Mo,o S) cos 9.0) 

1 Mii(s)cos( ¢+611)r Dipole 

2 Mo 2(s)cos(2¢+ 62,2)r? Quadrupole 

3 M3,3(s) cos(3¢+ 63.3)r?  Sextupole/Hexapole 
4 Maa(s)cos(4ó 4-044) r* Octupole 

5 Msas(s)cos(5ó 4-055) r? | Decapole 

6 Me 6(s) cos (60 + 05,,) r? Duodecapole 


3.1.2 Quadrupole Fields 


The case k = 2 leads to quadrupoles, and the potential has the form V = 
Mz > cos (2¢ + 02,2) r?. Particularly important in practice will be the sub-cases 
05.5 = 0 and 05. = 7/2. In the first case, we have 


V = M22 cos (26) r? = Moo (cos? $ — sin? $) r? = Ma» (s? — y’) ; 
and in the second case we have 
TN 2 : 2 
V = M»5cos (20 + z) r^ = —Mə 2 sin (2ġ)r 
= —M2 2 (2sin ġ cos 9) r? = —M2,2 - 2zy. 


All other angles 02,2 lead to formulas that are more complicated; they can be 
obtained from the ones here by subjecting the x, y coordinates to a suitable 
rotation. This again leads to terms of purely second order. 

Because the potential is quadratic, the resulting fields E or B are linear. 
Indeed, the quadrupole is the only s-independent element that leads to 
linear motion similar to that in glass optics, and thus has great importance. 

In the electric case, one usually chooses 05. = 0, thus having V = Ma a(x? — 
y?) and resulting in the fields 


Ez = —2M2 2 "T, Ey = 2M» "y. 


The fields extend throughout the length of the device, and thus provide strong 
focusing. Different from the case of glass optics, it turns out that the motion 
cannot be rotationally symmetric anymore. If there is focusing in the 
x-direction, there is defocusing in the y-direction, and vice versa. This effect, 
completely due to Maxwell’s equations, turns out to be perhaps the biggest 
nuisance in beam physics; i.e., if one uses piecewise s-independent particle 
optical elements, the horizontal and vertical planes are always different 
from each other. 

To make an electrostatic device that produces a quadrupole field, it is best 
to machine the electrodes along the equipotential surfaces, and utilize the fact 
that if a sufficient amount of boundary information is specified, the field is 
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y 


FIGURE 3.1: Ideal electrodes of an electrostatic quadrupole. 


uniquely determined, and hence must be as specified by the formula used to 
determine the equipotential surfaces in the first place. So in practice, the 
electrodes of an electric quadrupole often look as shown in Fig. 3.1. 
In the magnetic case, one chooses 05» = 7/2, thus having V = — M2 2- 2xy 
and resulting in 
Bz = 2M2 2 "Y, By = 2M» tX, 


and looking at the Lorentz forces that a particle moving mostly in s-direction 
experiences, we again see that if there is focusing in x-direction, there is 
defocusing in y-direction and vice versa. 


3.1.3 Sextupole and Higher Multipole Fields 


To study higher orders in k, let us consider the case k = 3. For 05,3 = 0, we 
obtain 


V = M; s cos (39) r? = M3 3 (cos? ¢ — 3 cos ósin? à) r? = M3 (3? — 3zy?).. 


In this case, the resulting forces are quadratic, and are thus not suitable 
for affecting the linear motion; but we shall see later that they are indeed 
very convenient for the correction of nonlinear motion, and they even have 
the nice feature of having no influence on the linear part of the motion. 
Another important case for 05.3 is 03,3 = 7/2, in which case one can perform 
a similar argument and again obtain cubic dependencies on the position. 

For all the higher values of l, corresponding to octupoles, decapoles, 
duodecapoles, etc., the procedure is very similar. We begin with the addition 
theorem for cos(19) or sin(l$), and by induction we see that each consists of 
terms that have a product of precisely l cosines and sines. Since each of these 
terms is multiplied with r’, each cosine multiplied with one r translates into 
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FIGURE 3.2: The s-dependence of the multipole strength. 


an x, and each sine multiplied with one r translates into a y. The end result 
is always a polynomial in x and y of exact order l. 

Because of their nonlinear field dependence, these elements will prove to 
have no effect on the motion up to order l — 1, and thus allow us to selectively 
influence the higher orders of the motion without affecting the lower orders. 
And if it is the crux of particle optical motion that the horizontal and vertical 
linear motion cannot be affected simultaneously, it is its blessing that the 
nonlinear effects can be corrected order-by-order. 


3.1.4 s-Dependent Fields 


In the case where there is no s-dependence, the potential terms that we have 
derived are the only ones; under the presence of s-dependence, as shown 
in eq. (3.4), to the given angular dependence there are higher order terms 
in r, the strengths of which are given by the s-derivatives of the multipole 
strength Mij;. The computation of their Cartesian form is very easy once the 
Cartesian form of the leading term is known, because each additional term 
just differs by the previous one just by the factor of r? = (x? + y?). 

In practice, of course, s-dependence is unavoidable: the field of any particle 
optical element has to begin and end somewhere, and it usually does this by 
rising and falling gently with s, entailing s-dependence as seen in Fig. 3.2. 
This actually entails another crux of particle optics: even the quadrupoles, 
the *linear" elements, have nonlinear effects at their edges, requiring 
higher order correction. The corrective elements in turn have higher order 
edge effects, possibly requiring even higher order correction, etc. In practical 
terms, charged particle optical systems are designed in such a way that the 
effect of the higher order field is smaller than that of the lower order field, 
which ensures that the iterative process converges. 

Without s-dependence, the case | — 0, corresponding to full rotational 
symmetry, is not very interesting since there will be no field left. This becomes 
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FIGURE 3.3: Scalar potential (solid), longitudinal (dashed) and radial 
(dotted) field distribution along the s-axis of a single (left) and dual (right) 
step in potential of a rotationally symmetric lens. 


clear if we rewrite eq. (3.4) for the case | = 0, which is 


MẸ (s) Ms (s) 
Mon, S) — — —3 = — A 3.5 
09) = TES CO Q^ (nl) C4) Sn 


When Mo (s) = 0, we obtain that V = Mo,o, which is independent of s and 
r. If we consider s-dependence, it actually offers a remarkably useful effect. 
While there is no r-dependence in the leading term, the contributions through 
the derivatives of Mo,o(s) entail terms with an r-dependence of the form r?, 
r*,.... Using eq. (3.5), we obtain the Taylor expansion of the potential, which 


is 


1 


= Mo,o(s) — = Mi (s)r? + Gy 


M (s)r* oe (3.6) 
Of these, the r? term will indeed produce linear, rotationally symmetric 
radial fields and lead to effects similar to those in the glass lens. In practice 
these fields are not very strong (proportional to MQ (s), compared to ME? (s) 
for the longitudinal field) and restricted to regions where the potential changes 
and are used in so-called weak focusing. In practice, potential changes often 
occur as transitions between regions of constant potential. This can be done 
as a single step as shown on the left of Fig. 3.3, or as a dual step as shown on 
the right of Fig. 3.3, where the latter has the advantage that no net change 
in potential occurs. 
The resulting fields are given by 
1 


E.(r,s) = ~My (8) +7 


3 
M§3(8)r? = mn 
1 jt 
E,(r,s) = = Mgo(s)r — z5 Mig (s)r? + 


The magnetic field components B,(r,s), B,(r,s) take the same form. Fur- 
thermore, there are usually quite large nonlinearities, and altogether these 
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FIGURE 3.4: Reference orbit of a bending magnet. 


devices are used mostly for low energy, small emittance beams like those 
found in electron microscopes. 


3.2 Fields with Planar Reference Orbit 


In the case of the straight reference orbit, we saw that Maxwell's equations 
entail a very clean connection between rotational symmetry and radial po- 
tential. As one may expect, in the case of a non-straight reference orbit, this 
is no longer the case. In this situation, Maxwell's equations have a rather 
different but not less interesting consequence as long as we restrict ourselves 
to the case in which the reference orbit stays in one plane. 


3.2.1 The Laplacian in Curvilinear Coordinates 


As it turns out, in this case the arguments to express the Laplacian in the 
new coordinates are similar to that in cylindrical coordinates. Let us assume 
that the motion of the reference particle is in a plane, and that all orbits that 
are on this plane stay in it. Let R(s) be the momentary radius of curvature 
as shown in Fig. 3.4. 


Then we have a situation very similar to cylindrical coordinates r, $, z 
centered around the momentary origin of R(s). In fact, setting h(s) = 1/ R(s), 
the particle optical coordinates x, y, s correspond to the cylindrical ones in 
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the following way: 


As we recall, in cylindrical coordinates the Laplacian had the form 


V( o alm ov md 10V eae 
PZ — Tür V Or r Ob Nr ôg 022' 


So we may expect that in particle optical coordinates, we in fact have 


PIC EPA TR Aine” + l : : a faa 
1+ hz Ox T) Ox 1+hzds \1+he ðs Oy? 


AV (x,y, 8) = 


A careful analysis based on the chain rule and determining the proper Jacobian 
reveals that this is indeed the case. The calculations are rather mechanical 
and not particularly interesting, but very involved [48], and we skip them for 
the purposes of this discussion. 


3.2.2 The Potential in Curvilinear Coordinates 


For the potential, we again make an expansion in transversal coordinates, 
and leave the longitudinal coordinates unexpanded. Since we are working now 
with x and y, both expansions are Taylor, and we have 


V —V(z,y,s Iy S 15 UE (3.7) 


k=0 l=0 


'This expansion now has to be inserted into the Laplacian in particle optical 
coordinates. Besides the mere differentiation, we also have to Taylor expand 
1/(1 + hz) : 

Toke =! (he) + (hx)? — (hz)? + 
After gathering terms and heavy arithmetic, and again using the convention 
that coefficients with negative indices are assumed to vanish, we obtain the 
recursion relation 


ük l+2 — — ay | — khay 44 + kh'ay 1) — Ak42,1 — (3k + 1) hay iu 
m 3khay 142 = k (3k mi 1) h?ay ns 3k (k Ex 1) h?ay 2142 
— k (k — 1 ay 11 — k (k — 1) (k — 2) ay 3152. (3.8) 


Although admittedly horrible and unpleasant, the formula apparently has the 
coefficient of highest total order k + l + 2 on the left hand side, and thus 
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recursively allows the calculation of coefficients. Indeed, the terms a;,o(s), 
aķ,ı(s) can be chosen freely, and all others are uniquely determined through 
them. 


To study the significance of the free terms, let us consider the electric and 
magnetic case separately. In the electric field, in order to ensure that orbits 
that were in the plane stay there, there must not be any field components in 
the y-direction in the plane corresponding to y — 0. Computing the gradient 
of the potential, we have 


k 
X x 

Ey (x,y =0)=- EET Ey (r,y 20)—5— J akir = 9, 
k i k 3 


and looking at Ey, we conclude that a;,4 = 0 for all k. So the terms ajo alone 
specify the field. Looking at Ex, we see that these are just the coefficients that 
specify the field within the plane, and so the midplane field determines 
the entire field. Furthermore, looking at the details of the recursion relation 
(3.8), it becomes apparent that all second indices are either | or | + 2. This 
entails that as long as aj, 4 terms do not appear, also ak,3, à5,5, ... terms do not 
appear. Indeed, the resulting potential is fully symmetric around the plane, 
and the resulting field lines above and below the plane are mirror images. 


In the magnetic field, the argument is rather similar. Considering the 
fields in the plane, we have 


E "x 
By (x,y 20) 2 - 5 aiat Bz (sy —0) - -  jakory T =p 
k f k l 


In order for particles in the midplane to stay there, we must have that B, 
vanishes in the midplane, which entails a;,9 = 0. So in the magnetic case, the 
coefficients aj, specify everything. These coefficients, however, again describe 
the shape of the field in plane, and so again the midplane field determines 
the entire field. In the magnetic case, the potential is fully antisymmetric 
around the plane, and again the resulting field lines are mirror images of each 
other. 


To summarize the findings, 


Electric field: akı =O for all k, azo specify everything. 
Magnetic field: ayo = 0 for all k, ax specify everything. 


To conclude, we note that it is possible to extend the entire discussion also to 
cases where the motion is not confined to a simple midplane. The derivations 
connected to this most general case become exceedingly complicated [49, 48] 
and go beyond what is appropriate for this book. 
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3.3 The Equations of Motion in Curvilinear Coordinates 


There are a variety of methods to derive the equations of motion in curvi- 
linear coordinates with the arc length s as the independent variable. It 
is conveniently done in the Lagrangian picture, in which one first expresses 
Cartesian variables by curvilinear coordinates and rewrites the Lagrangian. 
'Then one proceeds to a Hamiltonian through a Legendre transformation in 
the common way. In the Hamiltonian picture, it is then possible to perform a 
change of independent variable from t to s while maintaining the Hamiltonian 
structure [49]. 

While very illuminating, the Lagrangian-Hamiltonian mechanism is too 
involved for our purposes, and we thus follow a more straightforward, clas- 
sical way that leads to the same canonical equations of motion. For simplicity, 
we also restrict ourselves in that the reference orbit is allowed to bend in only 
one plane. 


3.3.1 The Coordinate System and the Independent Variable 


As a function of the arc length s, we first define the momentary curvature 
of the reference orbit as h(s). If the curvature is nonzero, the radius of cur- 
vature is then given by R(s) = 1/h(s). We begin by studying the bend angle 
that the reference orbit experiences as we move from position s to position s. 


We have FE > 
a= [48 - l m7] h(5)d3. (3.9) 


As described in eq. (1.1), in Cartesian coordinates, the equations of motion 
have the Lorentz force form 


E pEr eE], 


where É and B are the electric and magnetic fields, @ is the velocity, and 
q = Ze is the charge of the particle. Since the left hand side of the equations 
of motion contain momentum, it is often useful to express the velocity in terms 
of the momentum. From eq. (1.5), 


dr D 
f mg ucc e 
dt V p? + me 


which allows to maintain only the momentum p in the equations of motion. 
For the purpose of our derivation, we rewrite the equation as an integral 
equation: 
Q8) , 5 
P=) f Fos-mas-[ Faeas, 
t(s) s 
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where we have used t’ = dt/ds, and it is worthwhile to remind ourselves that 
the force F(s) still depends on both z and p. 

As we have progressed from s to 5, the orientation of our locally attached 
particle optical coordinate system has changed. It was rotated by the angle 
a of eq. (3.9). By using the rotation matrix 


cosa 0 sina cos f h(s)ds 0 sin f° h(s)ds 
R(s) = 0 1 0 |= 0 1 0 
—sina 0 cosa — sin We h(s)d5 0 cos f h(s)ds 


we have the momentum in the new rotated local coordinates pi(3) = (pz, Py, ps) 
as 


gs) = R(s) - ss) 
cos Hs h(s)ds 0 sin b: h(s)ds 


j (m "m (n+ f Fas). 


where we have expressed the last line in the integral form. In order to obtain 


the rate of change of the momentum pij, we differentiate with respect to s. 
Noting that 


evaluate at 5 = s and obtain 


p (s) = | ROI + RO Go + “Feytas) | 
0 0 h(s) 
= F(s)t' + 0 0 0 Ja (3.10) 
-h(s 0 0 


Note that the first term depends on the actual forces and the factor t’ accounts 
for the fact that we went to the arc length s as an independent variable. 
The second term is a pseudoforce due to the fact that we are located in a 
rotating frame. Indeed, for h = 0, we obtain the conventional result. We also 
note in passing that if we were to allow out-of-plane motion of the reference 
orbit, then the matrix R would depend on two curvatures. Unfortunately, in 
this case an additional complication arises from the fact that rotations around 
different axes do not generally commute [49]. 
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FIGURE 3.5: The curvilinear coordinates in the plane of the reference 
orbit. 


Next we make an observation regarding the rate of change at which dis- 
tances are covered at different positions x. Looking at Fig. 3.5, we observe 


dL R+2 


—— —124 hz. 
ds R E 


Using this and the components of the momentum, we obtain 


dr dL dz dx Dx 
Z = —— =(1+hz)— — (14 hz)— 
mo ee cpu 
dy | dL dy dy Py 
AL = (1 +ha)—2 = (1+ho)™. 41 
7o rap WR pedi ely 


For the time-of-flight, we consider the traveling distance divided by the 
velocity, and we obtain 


dx\* /dy\? (dL\? 1 p2 +p? 
ee we —) = -(1+h Z Y 41 
CONCI eye d 


(eh) P. (3.12) 


dt 
ds — 


where p = 4/p2 + p? + p? has been used. 

Altogether, we have so far obtained the equations of motion in local coor- 
dinates with s as the independent variable. From there to the particle optical 
variables, only a small step is left. We remind ourselves that the particle 
optical coordinates are (z,a, y, 5,1, 01 as listed in Table 3.2, where po and 
to are total momentum and time-of-flight of the reference particle; refer to 
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TABLE 3.2: Optical coordinates 


Coordinate Phase Space 

x Horizontal Position 

à = Pz / Po Momentum Slope 

y Vertical Position 

b — py/po Momentum Slope 

l — &(t — to) Longitudinal Time-of-Flight like Variable 
ô = (K — Ko) / Ko Energy Deviation 


eqs. (2.1) and (2.2). Likewise, the subscript 0 will be used below to indicate 
the respective quantity of the reference particle. 

In order to study relativistic effects, it is advantageous to introduce the 
relativistic measure 7, the ratio of kinetic energy to rest mass energy 


||. Ko(16) — ZeV | ZeV 


Le coptid A (3.13) 
MC 


MC 


where m is the rest mass. The quantity V is the change in energy that is 
incurred due to the passage through electric fields; it is given by 


V= SEO - üdt, (3.14) 


where 4 is the velocity, depending on position and time, of the orbit under 
consideration. In case the electric fields are time independent, the quantity V 
is merely the common electrostatic potential, which depends on the position 
coordinates (x, y, s). For time dependent fields, the quantity V explicitly de- 
pends on the specific time dependent orbit taken, which is of importance for 
the study of dynamics in RF cavities as discussed in Chapter 10. 

Since mc? represents the total energy, we have 


=1+7. (3.15) 


1 
w= lee 


Using these, we also have 


1 vant  wvm(Q-tm) 


a ? 


ys 1+ 1+ 


. =m(1+n). (3416) 


s ale 


MC mc ve 


As a first step, using eq. (3.12), we express the rate of change of the particle 
optical variable | = «(t — to) in terms of particle optical quantities: 


1 
z (ha) REP 1] £ faena) aw cis (3117) 
1+ 10 Ps vo 
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where the relation tọ = 1/vo is used because of psp = po. The term po/ps 
appearing above can be expressed by the optical coordinates a and b as 


—1/2 
Po _ Po NS (4 2 z) f 
— = = T a — b 
" peg eps WP 
24 —1/2 
= (2529.9) ; (3.18) 
no (2 + no) 
Next, by applying the Lorentz force law to eq. (3.10), we obtain 
00 A 
d (De 6 E 1 D 
A(z w æ) Forta] ooo ])Z 
ds \ Po po po Po -ho 0 J Po 
[ 24 Ü s x 
- Ze(É 8x B) E +n (2,02). (3.19) 
Po Po Po 
We here introduce the magnetic rigidity Xm and the electric rigidity Xe, 
p pu 


imr 7 Ae 


As we will see below, the magnetic and the electric rigidities describe directly 
to what extent the magnetic and the electric fields influence the geometric 
motion of the particles. The first term of eq. (3.19) can be expressed by using 
Xmo and xeo as 
Be nase, 3 
vot +u x —t. 
Xe0 Xmo 


= A o Be ta 2 
Ze(E ix B) — = E (E vx B) vot! = 
Po Povo 


From eq. (3.12), the factor vot’ in the electric term can be written as 


1 
ute ay Pest chay obe ata DU. 
v Ps Po V Ps 1+ 10 Ps 


Similarly, the factor vt’ in the magnetic term can be written as 


üt =" (tar) E =P tae) 2 = (1+ he) 2, 
U Ds p Ds Do Ps 
where v/v = p/p is used because of v || p. Thus, 
5 ENE 1 E B 
Ze (E - $x B) — =(1+he) SEMANA E 
Do 1+ "o Xeo Ds Po  XmoDs 


Continuing from eq. (3.19), we obtain 
4 (= Py e) 
ds \ po’ Po’ Po 


1 E > B 
a P caeno Bx E 4 a( 


= (14 hx) 
1+ no Xeo Ps Do  Xmo Ps 


Ps o. -E) 
Po Po 
(3.20) 
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Finally, we consider the change of the last variable in the particle optical 
coordinates, ó. Since by definition it describes the deviation of initial kinetic 
energy of the particle of interest, we have 


à — 0. (3.21) 


Note, however, that since 6 describes the deviation from the kinetic energy of 
the reference particle before the system, in case there is net acceleration or 
deceleration along the orbits, it may be desirable to periodically absorb the 
accumulated amounts in the orbit dependent path integral for V in eq. (3.14) 
into the variable ó. In the case the motion was merely through a static electric 
field, this will entail that ô will depend on positional variables. In the case of 
full time dependence as in the motion in RF cavities discussed in Chapter 10, 
after the renormalization, ó will depend on all particle optical coordinates. 


3.3.2 The Equations of Motion 


By observing that p/po = (a,b, ps/po), from eqs. (3.11), (3.17), (3.20) and 
(3.21), we obtain the equations of motion in particle optical coordinates: 


a(1+ he) 72 

1 E, Bs B s 
(1+ he) (Es Bn OBO Fe) ea 

1+ Xe0Ps  Xm0Ps Xmo Po 
b (1+ ha) E> 

m E B, B, 
eeu J Eup p Seg Se 

1 + 0 Xeo Ps Xmo Xm0 Ps 


1+ 
pas cena PA = 
1+ No Ds vo 


ó' — 0, (3.22) 


where we remind ourselves of the following abbreviations from eqs. (3.13) and 
(3.18), 


EE /2 
ZeV Po ( n (2 +n) 2 J ! 
= 1+6)-—>, — = | ———__ a —b : 
CUR Ds no (2 + no) 
and eq. (2.1), 
Yo 
= — V0 4 
1-4 yo 


Note that the factor «/vo in the equation for |’ can be expressed in terms of 
no instead of yo using eq. (3.15): 


K 14 fo 
Up 2+ 10 
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We observe that the horizontal (x-a) motion is affected mostly by E, and 
B,, and the vertical (y-b) motion is affected mostly by Ey and Bz, and it is a 
direct consequence of the Lorentz force law. When a longitudinal component 
of the magnetic field B, is present, it acts to mix the horizontal and the 
vertical motions through the B, dependent terms in a’ and 6’, which is even 
a linear effect. This phenomenon is readily observed in the spiral motion of 
a charged particle moving through a solenoid if any transversal component of 
the momentum exists, which is described by a and b in our case. 

A careful analysis of the equations of motion reveals that indeed if all the 
particle optical coordinates are small, so are their derivatives defined through 
the equations of motion; indeed, the system is weakly nonlinear. 


Chapter 4 


The Linearization of the Equations 
of Motion 


In order to develop a matrix theory of particle optics similar to the Gaussian 
theory in glass optics, we have to linearize the equations of motion. This 
procedure is rather similar to other linearizations in physics; in particular, 
it is very similar to the study of so-called small oscillations in mechanics. 
Since the solutions of linear systems depend linearly on the initial conditions, 
indeed the resulting transfer maps will be linear as needed. It is worth noting 
that, although a 6 x 6 matrix is required to describe the linear motion, only 
blocks of 2 x 2 and 3 x 3 are needed for decoupled linear motion. 

We begin the actual process of linearization with the linearization of the 
fields, which corresponds to quadratic potentials in eq. (3.7). We begin our 
discussion with the case in which the potentials on the reference orbit vanish, 
which describes the situation of electric and magnetic multipoles as well as in 
deflectors. The case of electric and magnetic lenses do require the presence of 
potentials on axis, and they will be discussed in detail below. 

In the electric case, let us assume that there is no potential on axis, i.e., 
ao,o = 0, and that in the midplane, we have 


Es = — Ezo (1 + nez). 


Because of the recursion relation for fields, eq. (3.8), we obtain an out-of-plane 
expansion of 
Ey = Ero(h + ne)y, 


as well as an electrostatic potential 


1 
V(z,y) = Exox + 5 x0 (nex —(h+ne)y"), 


which is chosen in such a way as to vanish on the reference orbit. 
In the magnetic case, let the midplane field be given by 


B, = Byo(1 + nz). 
Due to the recursion relation, we must then have 


By = Byonyy. 
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Before we even discuss linearization, let us consider the zeroth order of the 
motion: if the system is supposed to be origin preserving, then we must have 
from the equation of motion for a’ in eqs. (3.22) that 

Ezo " Byo 


—+——=h 
Xe0 Xm0 
which in a natural and expected way couples the constant parts of the fields 
with the curvature of the reference orbit. 
Now we begin our process of linearization of the equations of motion (3.22). 
It is easy to see that 


(4.1) 


, 


z'-—a, wy =b. 


We also obtain 


Z 
J.—140- — Ent, 
no Tome 
and after more complicated expansions 
1+ Z 
Th gee i E 
1-4 no 1-4 7o (1 + m) mc? 
2+7 no Ze 
=1+ ô — ———— Erot. 
2 + 7o 2 -F no (2 + m) mc? ane 


Similarly, by using v1 +u =; 14+ u/2, 1/(1 + u) 21 1 — u for small u, we 
obtain 


Ps _ n (2 4 n) a2 — b2 
Po no (2 + jo) 
1 No 1/1 1 Ze 
= pl EE ome —— Broz 
: ;( ;2-) (+ LP 2 
1+ 1 Z 
=] 1 + "o — ee ig, 
2-Fno  mo(2+ 10) me 


Note that the symbol “=,” means we are keeping terms up to first order. 
After lengthy similar arguments, we also conclude 


1 1 Ze K 
l= te - i + — — ——á5 E. J 
! (1 + no) (2 + no) no (1 + no) (2 + o) me? ^^ 
1 1 Z 
=1—|h ER E CHA zo| © + ———5, 
2-cTo mo (2+ mo) mc (2 + no) 
as well as 
E, B E, 1 1 Z 
deua. ou. diu EM. CE. D Nan Ze 
Xeo Xmo Xeo (14-mo)* | lo (2 + mo) mc 
E, 1 1+ 
T" 0 ; no ty 
Xeo (1+ o) 2 4 No 
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where the relation (4.1) is used, and 


Ey B 
b =; | D (h + ne) + Ben y. 


Xe0 m0 


To summarize, we have the linearized equations of motion as a set: 


x =1 Q, 
E, B E. 5 1 Z 
a’ = — h? + 0 qup e alle Oe EE s x 
e0 Xmo Xeo (14+19)* | no (2 + no) me 
Ey 1 1+ 
4 0 ; no ô, 
Xeo (1 +n) | 2+ o 
y = b, 
E; 
b 1 | 0 (h + Ne) + zai Y, 
e0 XmO0 
1+ 1 Z 1 
oe | asl eet s E 
2+10  mo(2-- mo) mc (2 +70) 


Now that the equations of motion have been linearized, they have to be 
studied for a variety of different cases. We begin with the simplest case. 


4.1 The Drift 


In the case of the drift, all fields are 0, and h = 0, so the linearized equations 
of motion have the form 


x’ =a, a’ =0, 
‘=), b — 0, 
1 
|! = — ——3, 6’ — 0, 
(2 + no) 


where of course only the last equation is of any real interest. These equations 
are trivial to integrate, and we obtain 


ty = frt UL, af = Gi, 
yf = yi + OL, by = bi, 
L 
lg ôi + lj, ôf = ĝi, 


(2+ 9)? 
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where L is the drift length, and they can be written in matrix form as 


Tf 1L0000 Ti 
af 010000 ai 
yp |} |001L00 Yi 
bs | 1000100 bi |? 
ly 00001D li 
Of 000001 ôi 
where 
D- L 
(2+ o) 


First, we observe that, as in the case of glass optics, the determinant is 
unity. We also observe that the matrix can be grouped to three blocks, corre- 
sponding to the x-a (horizontal), the y-b (vertical) and the /-ó (longitudinal) 
motions. So, in the linear approximation, those three submotions in the drift 
are decoupled, without having any mixing. This allows us to study the entire 
motion in each direction conveniently independently. Later, we often study 
the motion of a system in decoupled submotions. 

It is worthwhile to note that, when we take account of nonlinearity, even 
the drift motion is no longer simply linear, which may sound striking. This 
can be seen in the equations of motion (3.22), and the nonlinear effect comes 
from the factor po/ps, which contains the second order contributions of a and 
b. This effect is called the kinematic correction, and it also is responsible 
for nonlinear mixing of submotions in different directions even for the drift. 


4.2 The Quadrupole without Fringe Fields 


More interesting is the case of the quadrupole. Since the reference orbit 
goes straight, we have h = 0. 


4.2.1 The Electric Quadrupole 
For the electric quadrupole, from Section 3.1.2, we have 
V = M22 cos (20) r? = M,» (a? — y?), 


and 
Ez = —2M2 2 "T, Ey = 2M22 y 


while B = 0. If M22 > 0 for the positive charge beam, the field acts to 
focus the beam in the horizontal (a) direction, and defocus in the vertical (y) 
direction. The field description above corresponds to the case in the general 
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form that the constant part Ezo of E, is 0 and the factor of the linear term 


in z is Erone = —OE,/0x = 2M2 2. The equations of motion have the form 
2M: 
zg’ = a, a’ = 22 wa, 
Xe0 
2M: 
y =b, dE 22y muy, 
Xe0 
1 
i^e 5 ð =0, 
(2 + 70) 
where 


2M22 
w = | ——. 
Xe0 


Apparently we have sine-cosine solutions in the horizontal plane, and sinh- 
cosh solutions in the vertical plane. For the quadrupole with the length L, we 
have 


inwL 
zf = xi coswl + ai DS ; af = —wx;sinwLl + a; coswL, 
inhwL 
yf = yi coshwL + bj claus » bs = wyisinhwL + b; coshwL, 
L 
Lp = —— ——3ói t li ðf = ĝi. 
(2 + no) 


This can be written in matrix form as 


Le cos(wL)  sin(wL)/w 0 0 0 0 Ti 
af —wsin(wL)  cos(wL) 0 0 0 0 aj 
yp | 0 0 cosh(wL) sinh(wL)/w 0 0 Vi 
by 0 0 wsinh(wL) cosh(wL) 0 0 bil 
ly 0 0 0 0 1 D li 
Of 0 0 0 0 0 1 bi 
where 
2M. L 
w= 22 and D= 5 
Xe0 (2+ no) 


Similar to the case of the drift, the matrix can be grouped to three blocks, 
namely the horizontal, the vertical and the longitudinal motions. Again, this 
holds while we limit ourselves to the linearized motion. We observe that, 
as in the case of glass optics, the determinant is unity. Furthermore, note 
that if M5,» < 0, w is imaginary. In this case, the x- and y-planes exchange 
their roles, the quadrupole becomes focusing in the vertical (y) direction and 
defocusing in the horizontal (x) direction. To see the focusing action in the 
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similar style to the matrix form of the thin glass focusing lens, eq. (2.4), 
we consider a thin approximation of the quadrupole. While maintaining the 
integrated field strength, that is represented by 2M»: L, we make L — 0. 
Then, we have 


(ty uy) tios und 


This corresponds to a thin focusing lens with the focal length f as 1/f = 
(2M3.2/ Xeo) -L= w? : L. 

In various cases that will be studied in the following sections, we will observe 
a harmonic oscillator motion similar to the z plane motion in this section. In 
such a case, if the system is short it behaves like a thin focusing lens, and the 
focusing power can be obtained as 1/f = w? - L using the angular frequency 
w of the system, in the same way discussed here. 

It is also worthwhile to briefly mention the case of fringe fields. In this case, 
M22 changes as a function of s. The resulting ordinary differential equation 
(ODE) is still linear, which entails that the result can be written in matrix 
form, but in most cases is impossible to solve it analytically. 


4.2.2 The Magnetic Quadrupole 


In the case of the magnetic quadrupole, we have 
Vp = —2M2 2x "Y, Bz = 2M2 29, By = 2M2 22, 


while E = 0. This corresponds to the case in the general form that the constant 
part Byo of By is 0 and the factor of the linear term in x is Byonp = OB,/Ox = 
2M» 2. This results in the linear equations 


2M. 
x’ =a, d 2 2,2 Ww, 
Xm0 
2M. 
y-b V= Ayw, 
Xmo 
1 
T E ME 
(2 + no) 


Similar to the electric case, we have introduced 


2M22 
w= | ——, 
Xmo 


and the resulting transfer matrix is the same as in the case of the electric 
quadrupole. 
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FIGURE 4.1: A homogeneous magnetic dipole. 


4.3 Deflectors 
4.3.1 The Homogeneous Magnetic Dipole 


The next particle optical element we want to study is the magnetic dipole, 
consisting of a homogeneous, hence constant, magnetic field in the y-direction. 
We consider that the dipole element acts to bend the reference orbit by the 
bending angle ¢ with the bending radius Ro, so the curvature is h = 1/Ro. 
We also note that from eq. (4.1), h = Byo/Xmo. In terms of the quantities 
describing the linearized fields, we have 


Byo = constant, ny = 0, 


while E = 0. Keeping in mind magnet design, such a field can be obtained 
very schematically as shown in Fig. 4.1. 
Let us now consider the equations of motion; we obtain 


3 


12 
x! =a, a' =—h°r +h Jo 
2 + no 


1+ 1 
U =-h——Pr + — ó, 6-0. 
2+ no (2+ no) 


First we observe that if we choose h = 0, we obtain a’ = 0, and we have the 
same situation as in the case of a drift. But even for the case of h 4 0, the 
motion of the y-direction behaves simply like a drift, and we always have 


yf = yi + bi L, by = bi, 


where L is the arc length of the reference orbit in the dipole and L = Rog. 
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Next we observe that as always, 6 stays constant, and hence in the equation 
for a’ plays the role of a parameter, making the differential equation inhomo- 
geneous. Finally we observe that since | does not couple into the horizontal 
or vertical motion, we can solve the equation for l after the horizontal motion 
is analyzed by a mere integration. 

In order to solve the horizontal part of the motion, we first solve the ho- 
mogeneous part of the differential equation, which has the form 


and we obtain as a solution 
1 
v; =a, coswL + —ajsinwL = x; cos¢ + Roa; sing, 
w 
1 
aş = —wzisinwL + a; coswL = -R sin ó + aj cos ¢, 
0 
where we have used the angular frequency w, 
EN: 
c 


Altogether, we have a behavior not much different from a focusing quadrupole. 
In order to treat the inhomogeneity, we perform a so-called “variation of 
parameters," that is we make an ansatz of the form 


w=h 


1 
x (s) = %;(s)cos@+ Roà; (s) sin ó = T; (s) cosws + J“ (s)sinws, 
1 


-R7 (s) sin ó + à; (s) cos  — —wi; (s) sinws + a; (s) cosws, 
0 


where now the original parameters z;, à; are viewed as functions of s. Inserting 
into the differential equation, we obtain the following condition: 


& 
—— 
[VA] 
— 
Il 


1 
T; (s) cosws + —à; (s) sinws = 0, 
W 


1a 
—ui; (s) sinws + a; (s) cosws = A= hs No ôi, 


+ flo 


using the abbreviation A for the right hand side in the second equation. 
Rewriting in matrix form, this reads 


cos(ws) sin(ws)/w EN [0 
—wsin(ws)  cos(ws) à] NAJ’ 
Multiplying with the inverse matrix and integrating, we obtain 


Ti (s) = I (- sinus) Ads + zi = =A (cosws — 1) + zi, 


W 


3 1 
a; (s) = | (cosws) Ads + a; = —Asinws + aj. 
0 W 
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So the complete solution of the inhomogeneous part has the form 


1 1 |1 
x (s) = ES (cosws — 1) + s cosws + — | Asinus 4 «| sinus 
w w |w 
1 1 
= x, cos ó + —a; sing + — A (1 — cos), 
W W 
1 : Lan 
a(s) = —w | ~A (cosws — 1) + z; | sinws + |—Asinws + a; | cosws 
W Ww 


1 
= —wzr;sin $ + a; cos ọ + — A sin œ. 
w 


Thus we obtain 


1+ 
xs = zi cos + Roa; sing + Ro (1 — cos $) 5 sn 
T No 
L nising + a; cosó + sings 25 
af = —— rz; sin ai COS sin z 
T Ro 2+ no 


Finally we have to study the case of the time-of-flight part, which as we 
said before can be obtained by mere integration. We have 


L 
1+ 1 
0 2+ No (2 + no) 


NB ku P 1+ 
2 + no 2+ 


iy Roa; (cos ¢ — 1) 
no 


As a result, we see that all the final coordinates indeed depend on all initial 
coordinates in a linear fashion, and hence the relationship can be written in 
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terms of a transfer matrix. The general shape of this matrix is now 


cr coso Rosing 0 0 0 (ad) Tj 
af —sinó/Ro cosé 0 0 0 (ald) ai 
728 0 0 1 Rd 0 0 VW as 
by 0 0 0 1 0 0 b; 
ly (I) | (de 0 0 1 ayy] a 
5 0 p. 0 0 0 4 5; 


where Ro and ¢ are the bending radius and the bending angle, and the ab- 
breviated matrix elements are 


(249) = - (a) = 23 Pmj(-cwé), (að) = -(le) = m 7 sin 


"o t + 2 | 
o- sing| . 

2 - no 2 + No 

We observe that the determinant of the matrix is unity. Note that, while 
the y-b (vertical) motion is decoupled, the z-a (horizontal) and the l-6 (lon- 
gitudinal) motions are coupled. Specifically, the z-a motion depends linearly 
on ô. 

'The homogeneous dipole magnet we have considered so far has edges that 
are perpendicular to the reference orbit. So the region where the magnetic 


field is active corresponds to a sector of a circle, which is the reason such a 
magnet is often referred to as a sector magnet. 


(18) = —Ro 


4.3.2 Edge Focusing 


When the reference particle enters and exits a sector dipole magnet, the or- 
bit travels perpendicular to the entrance and the exit edges. When the magnet 
edge is not perpendicular to the reference orbit, additional focusing and de- 
focusing effects act on the beam, which is called edge focusing. The angle 
difference from the perpendicular sector magnet case is called the edge angle. 
Edge focusing is frequently used on purpose to modify the linear properties 
of the motion, or as a consequence of convenience in manufacturing since it is 
particularly simple to use a rectangular shape for the magnet, which leads to 
the so-called parallel-faced dipole. We now study the effects of edge focusing 
using the matrix form, and compare the result with the sector dipole. 

We measure the edge angle a such that the rectangular dipole would have 
positive edge angles. So, when a > 0, a particle that enters or exits the 
magnet at a positive x location experiences a lesser amount of the bending 
magnetic field compared to the reference particle. Compared to the sector 
dipole magnet, this means that the edge line tilts inward for positive x as 
shown in Fig. 4.2. 
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Entrance Exit 


FIGURE 4.2: Entrance and exit edge lines of a dipole magnet with edge 
angle a. 


When the edge angle a is sufficiently small, the effect can be approximated 
as an impulsive effect that changes only the horizontal and the vertical angles 
of the particle orbit but does not affect position nor the longitudinal motion. 
The impulsive style of treating such an effect is called “kick,” and the kick 
approximation is sometimes used when studying beam optical systems in lin- 
ear approximations in a similar way as it is in glass optics in the use of the 
thin lens. 

As we will explain in the following, in the kick approximation, the effect of 
the edge angle a for the homogeneous dipole magnet acts to change only a 


and b via 
x, tana u yi tana 


Ro ^C 


where Ro is the bending radius of the homogeneous magnet, and 


(4.4) 


Gf = Qi 


1 Byo 


Ro Xmo. 
'The same expression applies to both the entrance edge and the exit edge. 
Using the abbreviation 
T = tan a Rio, 


the matrices of the horizontal (z) and the vertical (y) kicks by the edge angle 
a are described as 


Recalling the situation of thin glass lenses, when o > 0, the vertical kick 
acts to focus the beam, and the horizontal kick acts to defocus. Combining 
the horizontal and the vertical kicks, the effect of the edge is that of a thin 
quadrupole of the strength —T, where always one of the directions experiences 
focusing, and the other defocusing. When the sign of a is opposite, the effect 
also becomes opposite. 
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FIGURE 4.3: Mechanism of edge focusing for the horizontal plane (left) 
and the vertical plane (right). 


We use a geometric argument to explain the horizontal kick. For the vertical 
kick, we use a step function to model B,, and use Maxwell’s equation to derive 
B, which affects the vertical motion. 

We have a homogeneous bending magnet with the entrance edge line tilted 
by the edge angle o as shown in the left picture of Fig. 4.2. Using the standard 
step function H, also often referred to as the Heaviside function, the vertical 
field component By can be expressed as 


By(z,y,s) = ByoH(s — z tana), 


where B,o is the constant field of the main part of the dipole. The Heaviside 
step function H is related to the Dirac delta function ó as 


moy- f^ sys -f forz >0 


0 forz«0' 


+oo forr-0 x 
49-1 0 forz Z0 O(a) =1 for anye>0. (4.5) 


=e 

The B, expressed above is of course an idealized situation. In reality there 
is no magnetic field that can fulfill the above expression while satisfying 
Maxwell’s equations. 

Now consider a particle approaching to the entrance of the magnet paral- 
lel to the reference orbit, but positioned at x. As seen in the left picture of 
Fig. 4.3, the entering of this particle is delayed by the distance xtana. In 
the meantime, the reference particle travels through the magnet for this much 
of arc length, experiencing a deflection angle amounting to 8 = x tan «/ Ro. 
When observing the situation in the particle optical coordinates that are at- 
tached to the reference particle's motion, the particle of interest located at 
the position x appears to have experienced a change in the direction of mo- 
tion by +a tana/Ro. Note that the picture in Fig. 4.3 is drawn exaggerated 
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to emphasize the relevant points. The same explanation applies to the situ- 
ation at the exit, where the change in the traveling direction appears to be 
+z tan a/ Ro. Thus, the formula for a; in eq. (4.4) has the +z; tana/Rp term 
for both the entrance edge and the exit edge. 

Next we consider the equation for b’ in the set of equations of motion (3.22), 
and observe that the B, term is the leading term to affect the vertical motion. 
As seen in Fig. 4.1, there exists a non-vertical field component around the 
edges of the magnet off the midplane, i.e., when y 4 0. Also, see the right 
picture in Fig. 4.3. From Maxwell’s equations (3.2), we have the relation 


0B, OB, 
Oy ðr’ 


and using this, we can derive B, as 


OB o 
B,(x,y, 8) ET = J [Zro — xtana)| dy 
= —Byotan aó(s — x tano): y. 
Thus, the equation for b’ of (3.22) becomes 


B B 
y = —* =- tanaó(s — z tana): y, 
Xm0 Xmo 
and in the impulsive or kick approximation we obtain 
tano 


Ro 


B 
bp = bi — ZM lana f (s — ztana)ds -y =b; — 


Xm0 


Yi. 


Now, at the exit side, having the opposite sign for the step function, By is 
expressed as 
By(z,y,s) = ByoH(—s — z tana), 


resulting in 
B,(x,y, 8) = —Byotanaó(—s — z tana): y. 


And, we obtain the same result as for the entrance case, namely 


tano 


B 
by bio D tana | (s - ctanads yc hi - R Yi- 
0 


Xm0 
We note that the rise of B, is caused by the mere tilting of the edge line; 
thus a longitudinal component D, also exists, and it can be derived in a similar 
fashion. But the B, dependent term in the b’ equation of (3.22) also depends 
on a, turning it to be a nonlinear term; thus we do not consider it for this 
linear kick approximation. Another note is that in the homogeneous sector 
dipole magnet, B, does not exist because a = 0. 
Since the rectangular dipole is rather commonly used, it is worthwhile to 
calculate the total transfer matrix of a rectangular dipole by combining the 
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edge focusing and the main part of the dipole using eq. (4.3). In this case, 
the edge angles of the entrance and the exit are the same, namely half of the 
bending angle and thus $/2. We study the x-a-I-6 block and the y-b block 
separately. Below, the abbreviation T = tan($/2)/ Ro is used. 

First, we calculate the combination of the main part and one edge for the 
x-a-l-6 block: 


cosó Rosing 0  (m|ó)ai 1000 

Hate | = sind/Ro | cosó 0 (a|ó)ai T 100 
P * (Ua) ai (I|a)ai 1 (1|ó)a; 0 0 1 0 

0 0 0 1 000 1 


cos¢+TRosing Rosing 0 

_ | -sino/Ro-Tcosó  cosó 0  (a|ó)a 
(H|z)a; + T (Ha)ai (lla)ja; 1 
0 0 0 


where the (1,1) and the (2, 1) components of the matrix multiplication are 
simplified to 1 and —T as follows. 


cos 9 + T Rosin ó = cog? £ — sin? 3 tnt . 2sin S cos $ =; 
— sin ġ/ Ro + T cos ó = -T [25i $ cos $ /tan $ — cos = aÊ + sin? t] = T: 
So, we obtain the matrix of the x block as 
1000 1 Rosing 0 (a|d)ai 
Xr, = MOMEN = T100 -T cosó 0 (ald)ai 
0010 (I|z)ai + T (1|a)ai (l|a)ai 1 (1|ó)a; 
0001 0 0 0 1 
1 Ro sin $ 0 (x|d)ai 
Hu 0 1 0 T(zx|ó)ai + (a|ó)ai 
T(la)ai + (]|x)a: | (aja; 1 (1ó)ai l 
0 0 0 1 


where the same simplification happened for the (2,2) component, and 


14 


(z|9)ai = 5 


7? Ro (1 — cos), 
no 


14 2 
"o s-( 2) singl , 
2+ No 2+ no 


(1|d)ai = —Ro 
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and 
T(z|ô)ai + (alð)ai = — [T (1]a)ai + (lo:)ai] 
1 9 1- n L+. 
= — tan — is ; 
Re omms of Sg. ep 
jak ju 
= ao [anf (1 — cos $) sing = T nf. 


The matrix of the y-b block is 


Xp 1 0\ /1 Rod Loy. 1— TRoó Roó 
GET M 0 1 -T 1} (-T(2—TR) I—TR]' 


where 
1 Q Q 
1- TR) 2 1- ótan$, -T(2—T Ryd) = —F tan $ |2- otan $]. 
In summary, we obtain 
LF 1 Rosing 0 0 0 (ad) Ti 
af 0 1 0 0 0 (ald) ai 
up |_| 0 0 1-ómn(ó/) Roe 0 0 ||w 
by 0 0 (b|y) 1—¢tan(¢/2) 0 0 bi |? 
Ip (liz) (a) 0 0 1 (ys) [d 
Of 0 0 0 0 0 1 ĝi 
where the abbreviated matrix elements are 
1 @ Q 
bly) = —— tan — |2 — ġ tan — 
(oy) = -F tan $ f2- otan $|, 
1+ 1+ 
(z|6) = (lla) = I7 R (1 cos¢), (alô) = (le) = 5+ . 2 tan £, 
2+ 70 2+ n 2 


(10) = —Ro 


7) 1+ Y 
0 + no : 
o- ( ) sind] . 
2+ No 2+ No 
Note that the determinant of the matrix is unity, which also can be deduced 
from that the determinant of all the contributing matrices is unity. 
To conclude, let us compare the characteristic effects of the rectangular 


dipole and the sector dipole in the limit of small deflection angle ¢. The x-a 
matrix and the y-b matrix of both dipoles can then be approximated as 


ipole: P 1 Rog j 1 Roó 
Sector dipole: Mza > ee 1 ) , Mp> G E 
Rectangular dipole: Mya > (o 2 , Myo ee a 


Thus we have the interesting effect that the characteristic behavior in the 
horizontal plane and the vertical plane is exchanged. 


82 An Introduction to Beam Physics 


weaker field 


FIGURE 4.4: Aninhomogeneous sector magnet. 


4.3.3 The Inhomogeneous Sector Magnet 


In the case of an inhomogeneous sector, there is a magnetic field that is 
constant in s-direction, but not constant in the x-direction; rather it has the 


shape 
x 


From the recursion relations for the fields, we infer that the corresponding 
horizontal field is j 

By = —Byo s "Ro 
ny in eqs. (4.2) is n; = —n/ Ro, and h = 1/ Ro. In general terms, such a field 
is obtained by changing the distance between what generates the fields (coils 
or iron) as a function of z, similar to what is shown in Fig. 4.4 for the case 
of n > 0. 


We have the linearized equations of motion as 


3E 1+ 
x =a, a = -(n?- e rts 2056 = —h*(1—n)z-Fh ly 


Xmo Ro 2+ 70 2+ 
1 1 Byo n 
y =b, DUE NM E DE 
Xmo Ro 
1+ no 1 


Fece 
2 + no (2+ m) 


We observe that the horizontal motion is similar to the case of the homoge- 
neous sector dipole, except that the strength of focusing now also depends 
on n, the field inhomogeneity. Different from the homogeneous sector dipole, 
there is now an effect in the vertical direction, which can be either focusing 
or defocusing, depending on the sign of n. 

The solution of these equations of motion proceeds in the same way as 
before, first solve the homogeneous system, then address the inhomogeneity 
arising from 6 via variation of parameters, and finally solve for | by a mere 
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integration. In horizontal and vertical directions, the homogeneous solution 
corresponds to harmonic oscillators with frequencies 


We = hy1-m, wy = hyn. 


For 0 < n < 1, the magnet is focusing in both planes. An interesting case 
occurs for n = 1/2, in which case the magnet focuses x and y identically and 
represents a nice equivalent of the glass lens. 

The remainder of the derivation is tedious algebra, and we will only list the 
result here. 


LF (a|xz) (xla) 0 0 O (ad) Ti 
af (a|r) (ala) 0 0 0 (ald) m 
yp | 0 0 (yy) (wb 0 0 V |. (46) 
br 0 0 (by) (bb 0 0 b |? cC 
n (|y) (ja) 0 0 1 (o) li 
à, 0 0 0 0 0 1 à; 

where 


(x|x) = (ala) = cos (vi — nó) ; 
Gia E E dues eT 


V1-n Ro 

(uly) = (blb) = cos (V6), 

Ru a va, 
(yib) = 7 sin (Vn$) (bly) = —A sin (Vn$) 
(218) = =a) = 529909 [1 — eos (V7 n)]. 
(aj) = - (lx) = se sin (VIn), 

(^p (lt \ rien al E La (MT 
CY pple glen 


and the determinant is unity. 


4.3.4 The Inhomogeneous Electric Deflector 


Rather commonly known is the motion of a particle in an electric capacitor. 
Neglecting fringe fields, it follows a parabola as shown in Fig. 4.5. For particle 
optical purposes, such an arrangement is not particularly suitable for two 
reasons. Firstly, the reference orbit has a curvature that depends on s, which 
makes the differential equations non-autonomous. Secondly, the potential 
along the reference orbit changes with s, which complicates the dynamics. 
Both of these problems do not appear if instead of a straight capacitor, one 
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parabola 


FIGURE 4.5: An electric capacitor consisting of two parallel plates. The 
orbit of a particle is parabolic. 


Ro 


FIGURE 4.6: A concentric electric deflector. 


chooses a curved one in such a way that the reference orbit is concentric 
together with the plates as shown in Fig. 4.6. 


To obtain the linearized equations of motion for such a device, we first 
observe that 


while B = 0, and using eqs. (3.16), 


(2 2 2 
Povo el Oa -¥ tio +m) +10) _ me no ( +) 


AOT ES T 1+ 0 Ze 1+ 0 


We describe the linearized electric field using the field inhomogenuity n, sim- 
ilar to the magnetic case, as 


hence ne in eqs. (4.2) is given by ne = —n/Ro, and h = 1/ Ro. The linearized 
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equations of motion are 


1 1 1+ 
az’ =a, a =—h? gi n + | OP Ee Mg 
(1 + mo) (1+ no)" | 2+ flo 
y =b, v =h? (1—n)g, 
1+ 1 1 
|,--hLLB |1 + ——, ] 2 + ——6,_ 6-0. 
2 + No (1 + 70) (2 + no) 


Since the electrostatic deflectors are used primarily for low energy electrons 
and ions (usually below 100 keV) due to the difficulty of achieving high static 
voltages, the particles are non-relativistic, i.e., ro < 1. As a result, the equa- 
tions of motion can be simplified to a more familiar form 


r =a, a! = —h? (8— n) z + hô, 
1 
y' =b, b = =h? (n—1)y, l = -he + 78, ó' = 0. 


We observe that for 1 < n < 3, both x and y planes are focusing, differ- 
ent from the case of quadrupoles where always one plane defocuses; but the 
amount of focusing in the x and y planes is different. Indeed, similar to the 
inhomogeneous dipole magnet, the transfer matrix is 


Tf (a|z) (xla) 0 0 0 (a|d) Li 

af (alx) (ala) 0 0 0 (ad) m 

us| 0 0 (wy) (wb O0 0 Yi 

b, 0 0 (by) (bb 0 0 bi | 

lf (jy) (ia) | 0 0 1 (I8) li 

T 0 0 0 0 0 1 5; 
where 


(z|x) = (ala) = cos (v3 — nó) ; 


(xja) = asin (V3 — n$), (ala) = — ya ” sin (V3 — ne) ; 
(yly) = (blb) = cos (V/n — 19) , 

Qi) = sin (VETTE), (Bly) = -V sm (Vn 1o). 
T =a 2 te Cu mms 

(alo) = —(l|x) = ain (V3 — n9), 

(18) es in ( 1 -ie- Gon (79) ; 


(3— 
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1 $ 


FIGURE 4.7: An electric deflector with cylindrical plates. 


LLNS 
n | 


FIGURE 4.8: An electric deflector with spherical plates. 


and the determinant is unity. 

So far, no assumptions have been made about the vertical shapes of the 
electrodes, and in fact a variety of choices exist. Two common situations are 
the cylindrical plates and the spherical plates, as shown in Fig. 4.7 and Fig. 
4.8, respectively. 

In the case of the cylindrical field, we have from Gauss’ law that E c 1/R, 
which implies the expansion 


Ro x 
E.-——BEL-————-i1-—-EBÀE|1—-—], 
? Roc : y x) 


and hence corresponds to n — 1. The transfer matrix is 


cf cos(V/2¢) (Ro/V2)sin(V2¢) 0 0 0 (ald)\ /zj 

af —(V2/ Ro) sin( 249) cos(/20) 0 0 0 (ald) || a 

yp | 0 0 1 Rod 0 0 Yi 

bp | 0 0 0 0 0 b; 

ly (Ia) (lja) 0 1 (18) || L 

Of 0 0 0 0 1 ĝi 
where 


aaa A [1 — cos (V26)], (als) = -d3 = 2s on (vae). 


/2 
O EE Fa E RE (v89)] 
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For ó = «//2 = 127.28, the deflector is imaging in the horizontal plane, 
which has been used as energy spectrometers. 
In the spherical case, we have E œ 1/ R? and thus 


R : x 
Ez = — Ero 0 —1 —Exz, 1—-2— ’ 
Rota Ro 


and n = 2. The transfer matrix is 


LF cosó Rosing 0 0 O0 (a|d) Ti 

af —sinó/Ro cosó 0 0 0 sing aj 

Uf 0 0 cosó Rosing 0 0 Yi 

by | 0 0  —sinó/Ro coó 0 0 be 

ly —sing (lja) 0 0 1 (lô) li 

5 0 0 0 j^. p. ef 5; 
where 


(2/8) = —(iJa) = Ro(1-cosd), (16) = —Ro (16 : sind) | 


When ¢ = m = 180°, this deflector forms a simultaneous image in both plane, 
also known as a stigmatic image. It has been and still is widely used as 
an energy spectrometer. It is also called a hemispherical analyzer, which is 
the main workhorse in the field of angle-resolved photoemission spectroscopy 
(ARPES). 


4.4 Round Lenses 


We now address the important class of so-called round lenses, which owe 
their name to their rotational symmetry along the beam axis. Electric round 
lenses are usually made of arrangements of rotationally symmetric metallic 
plates or tubes concentric with the reference orbit, each of which is held at 
a certain potential; and magnetic round lenses are usually made of solenoids 
carrying current and concentric with the reference orbit. 

'The rotational symmetry apparently entails that any fields in x and y di- 
rections are equal, and Maxwell's equations immediately show that this can 
only happen if there are also fields in the direction of the axis, which requires 
that the potential changes along the reference axis. Specifically, the potential 
for the rotationally symmetric case is described in eq. (3.6), and is given by 


1 1 
V =9 Vo (s) = ridi (s) r? =2 Vo (s) — 2 (s) (x? + y’) 
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up to second order in r. Therefore, the electric field, linear in x and y, is given 
by 
1 1 

Es zz 3 o (s) T. E, —1 3o (s) y; Es =1 -V (s) , 
where E, and E, can also be expressed in terms of E, as 
1 
2 


Fully analogously, we obtain in the magnetic case that 


1 
Ey =1 -3E5 (s) x, Ey =1 — E; (s)y. 


1 1 
Bz =r o (s) T, By =f zo (s) Y, D; = -V (s) ’ 


where B; and B, can also be expressed in terms of B, as 


1 1 
Be =1 -32s (s)z, By=1 -3Ps (s) y. 


We note that the expansion order listed in the subscript of the equal sign 
here in this section only applies to the variables r, x, y, a, b and 6, while 
the dependence on s is explicitly retained and not expanded. Fig. 3.3 shows 
typical resulting electric and magnetic scalar potentials as well as the resulting 
radial and axial fields for such cases. 

Before embarking on detailed studies of the dynamics in electrostatic and 
magnetic round lenses, we illustrate one of their important characteristics. We 
determine the average radial electric or magnetic field along a straight line a 
fixed distance r away from the center. We perform the averaging from —S to 
S where S is chosen large enough that all fields vanish at +5. We obtain 


S S 
Eds- | B,ds 


" 1 1 1 / / 
= i. =V (s)rds = z" [Vo (S) — Vo(—S)] = 0. (4.7) 


So all radial field components average out to zero. This is in stark 
contrast to for example the electric and magnetic quadrupoles or the combined 
function bending magnets, where the integrand of the radial field is constant 
throughout the integration. 

Consider now the case of a thin round lens in which a particle does not 
change position much, similar to the situation in the idealized thin lens. In 
this case the average in the above field integrals will be responsible for the 
directional offset the particle experiences, and so this offset is zero. Thus any 
focusing action the particle may experience must come through secondary 
effects and is the result of a then incomplete cancellation of radial field con- 
tributions. T'his situation is sometimes referred to as weak focusing. These 
effects will be studied in detail below for both the electrostatic and magnetic 
cases. 
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Since the case of rotationally symmetric s-dependent potential is not cov- 
ered by the assumption used for the linearized equations of motion (4.2) de- 
rived in the beginning of this chapter, we go back to the general equations of 
motion (3.22) in Chapter 3. In this case, h — 0, and hence the set of equations 
is given by 


=a a= 1i Ee Po - By | Bs Po 
Ps ° 1 +o Xe0 Ps  Xmo  XmoPs ` 
1 _ Po 2 1+ Ey po Bz Bs po 
yapa a p eU Se GET 
Ps 1 +o Xe0 Ps Xmo XmoPs 
1 
E Haaa) SM 5’ — Q, (4.8) 
1+ No ps vo 
where 
ZeV Do n(2+n Ta 
n =m (1+8) —— 7 e -( Shack) PINE NE 
mc Ds no (2 + jo) 
M SL ou _ Povo 
vo 2+ no Amo Ze! «en Ze 


In the following subsections, we study the two important regularly employed 
classes of lenses, the magnetic round lens and the electric round lens. As we 
will see, different from the cases of the electric and magnetic quadrupoles, 
their focusing properties arise from different mechanisms, and the magnetic 
and electric round lenses require a quite different treatment. 


4.4.1 The Electrostatic Round Lens 


Electrostatic round lenses are arrangements of metallic electrodes with ro- 
tational symmetry that act as equipotential surfaces and thus determine the 
on-axis potential. Electrostatic round lenses are frequently used in electron 
microscopes and also in the shaping of low-energy beams near the source. For 
a rotationally symmetric electrostatic lens, we have the electric field, linear in 
x and y, given as 


Es = VW (s)m Ey— Vo (s) Es= -Vo (s), 
and the corresponding electrostatic potential 


V=Vo(s)-5 (5) (a? + y?) 
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Applying these to eqs. (4.8), we have 


1 1 pol 
x = Poy a! = amd) EU (s) x, 
Ds 1+ 0 Xeo ps 2 
147-5 1 pol 
y 2B vsti (s)g, 
Ds 1+ 0 Xeo Ds 2 
1 
pe Ha 4) 5 à =0. (4.9) 
1+ No Ds vo 


The z-a part and the y-b part are decoupled, and they have the same form, 
so we only need to study one of them. 

For the further study, it is convenient to express the a’ equation above 
in various ways. Using the relation p/v = m(1+7) from eq. (3.16) and 
v/v = p/p because of v || p; we can write 


a= = 
1 + no Xeo Ps 2 R 


1 1 1 1 
Pat Eie o ea EDUC (4.10) 


We note that in linearization, vs, the s component of v, equals the velocity of 
the reference particle. Furthermore we observe that the focusing effect rests 
on the assumption that traveling through the potential leads to appreciable 
changes in velocity, which, for practically achievable voltages, limits the use of 
the effect to the near non-relativistic regime. So in the following we perform 
our argument in the non-relativistic limit and have 


v) — y2Ko/m — VKo 


v,  J/2K(s)m  4/Ko-— ZeVo(s) 


Here K(s) denotes the momentary kinetic energy of the reference particle, 
which of course changes as a function of position in the lens due to the change 
of Vo(s), while Ko is the constant kinetic energy of the reference particle before 
the round lens. 

'The above differential equations are all that is needed to determine the 
transfer matrix for the linearized motion of an electrostatic round lens with 
a certain potential distribution Vo(s). However, in the following, we try to 
obtain a better understanding of the situation, and in particular we show the 
reasons why these lenses are generally focusing. 


4.4.1.1 Hard Edge Fringe Fields 


As a first step towards understanding the behavior of electrostatic round 
lenses, we discuss the fringe field effects appearing in an abrupt transition into 
a region of axial field from a field-free region. In practice this situation arises 
when transitioning through a hole of small aperture in a large charged metal 
plate. When passing through the plate leading to the idealized hard edge at 
s = 0, E, and E, are expressed in terms of the Heaviside step function H and 
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the Dirac delta function 6 defined in eqs. (4.5) via 
E,(s) = EoH(s), E! (s) = Eod(s), 
which in terms of potentials corresponds to 


Vols) = V — sEoH(s), Vo(s) = — EoH(s), Vo (s) = —Eoó(s), 
where we assume that the transition happens at an initial potential V, and we 
use the impulsive kick approximation in a similar manner as applied to the 
dipole edge focusing in Section 4.3.2. The position x is unaffected over the 
infinitely short transition at s — 0, and we obtain 


pages je T VEO a s 
2Xe0 Jo- y Ko — ZeVo(s) 
Eo v Ko u v Ko 


44——— ——————— — 08 + tia R; 
2Xe0 V Ko — ZeV v Ko — ZeV 


where we use the abbreviation 


= 04 — 


Eo 
2Xe0 l 


In an analogous way we can treat the transition from a region with field Eo 
into a field free region, and obtain 


E.(s)— EoH(-s)  E,(5) = —Eoó(5), 
where the exit is at 5 = 0, so that we obtain 


Eo dm = ai — ee LCS 
ioe VKo — Zey | VESSAEGEC 


To summarize, the 2 x 2 transfer matrices for (x,a) at the entrance and the 
exit are given as 


af = Qi + tiz 


Bs 1 0 ^ 1 0 
Nr» = Mort = 
» Ko/(Ko — ZeV) 1 J e Ko/(Ko — ZeV) 1 ) 
(4.11) 
where a = — Eo/(2x«eo) is used. We emphasize that V is the momentary value 


of the potential, and V appearing in M» may differ from that in Mo, 

Overall we have kick effects that, similar to other fringe field cases, leave 
the position unaffected, but change the directions by an amount proportional 
to position. For a particle of positive charge, stepping into a positive field 
from a field free region leads to a focusing effect. Changing the sign of the 
field or stepping out of a positive field leads to a defocusing effect. 
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4.4.1.2 The Mechanism of Focusing 


We now consider in a very general manner the situation of a weak or short 
electrostatic lens. We observe that in eq. (4.7), one important effect has been 
neglected, namely the fact that the particle necessarily gains and loses energy 
as it travels through the lens. As a matter of fact, any transverse field bends a 
particle more readily when it is applied where the particle has lower velocity. 

Consider a weak lens that consists of a potential that in the region from 
—S to +S is first constant, then rises, then plateaus, and then falls off to its 
original constant value, similar to the case shown on the right in Fig. 3.3. 
Necessarily the regions of positive second derivative appear near the minima 
of the potential function, while the regions of negative second derivative cor- 
respond to maxima of the potential function. So while traveling through the 
lens, the particle is more sensitive to the focusing fields than to the defocusing 
fields; and even though the focusing and defocusing portions cancel, the net 
effect is that the orbit experiences focusing. To observe this quantita- 
tively, we transform the integral in eq. (4.10) via integration by parts, and 
obtain 


S 
Ti V Ko y! (s) Xi 8 V KoZeVg3(s) 


2xeo / Kg — ZeVo(s) ° E 4Xe0 -s [Ko — ZeVo(s)]? 


The first term vanishes, and using Xeo = povo/(Ze), which non-relativistically 
reduces to 


af = aid Vo (s) ds. 


2Ko 


=> 4.12 
oa (4.12) 


Xe0 = 


the expression takes the form 


a= f Inm 
af = Aj — 22; NEC C SAT 
4 _s | Ko — Zev (8) 


Since the integrand involves only non-negative terms, the integral itself 
is non-negative; and if there is any place where the potential on axis Vo(s) 
actually changes, then the integral is actually positive. Thus the change of a 
is negative, corresponding to focusing. 

As we have seen, other than restricting ourselves to a thin or weak lens, 
this result has been obtained by mere manipulations of the integral without 
any assumption of the specific form of Vo(s) except for its constancy at +5. 
This is actually a small example of various similar manipulations that have 
been developed in the past; for example one can show the focusing property 
even in the extended case [60, 67]. More impressively, conceptually similar but 
practically more involved arguments can be used to prove Scherzer's theorems 
[62] about signs of higher order aberrations. 

'The resulting formula can be used to obtain an approximation of the focal 
length of a thin or weak electrostatic lens. However, other coordinates than 


[v 23 


EE 


The Linearization of the Equations of Motion 93 


our x and a are used frequently in the electron optics community; in particular, 
the coordinate x is scaled to z by a factor depending on the kinetic energy of 
the particle. This leads to less change in the value of the position coordinate 
T as we travel through a lens. This nonlinear transformation leads to the 
approximation of z—constant being better than that of z—constant, which in 
turn leads to better estimates for focal length. There is a large amount of 
work on this topic, but we forgo the details and refer to some of the literature 
[56, 60, 67]. 


4.4.1.3 The Plate Lens 


We now address a particular type of lens for which it is possible to derive 
the transfer matrix analytically. We consider combinations of individual plates 
placed perpendicular to the optical axis, each of which has a small hole in its 
center through which the beam travels. Each of these plates represents a hard 
edge fringe field as discussed above. 

The plates are held at different potentials and form equipotential surfaces. 
Assuming that the plates extend to infinity, the electric field between two 
successive plates is that of a common plate capacitor, and it points in the 
direction of the reference axis. 

For any electrostatic lens it is desirable that the field far away vanishes. 
Compared to the case of the hard edge fringe field, this requires the use of at 
least two plates. Assuming the field between the plates to be Eg and their 
distance to be S, i.e., 


Eo for0<s<S 
EG) - | 0 fors<0,5s>S8”’ (4.13) 
the potential at the second plate is V = —S Eg, and we have the potential 
function 
0 for s <0 
Vo(s)= 4 —sEo for0<s<S. 
—SEg fors>S 


The effects at s = 0 and s = S are merely those of hard edge fringe fields as 
discussed before. It is worthwhile to point out that if V > 0, which entails 
that Eo < 0, a particle of positive charge has lower energy at the second plate, 
and thus is more susceptible to the transverse electric fields there. From the 
discussion of hard edge fringe fields, we know that the transverse fields in 
both places are of opposite sign but identical magnitude, thus the second 
field, which happens to lead to focusing, has a more significant effect. 

Using eqs. (4.11), we have the transfer matrices for the kicks at s = 0 and 
s = S as follows. At s = 0, we have 


Vo(0) 20, K(0)= Ko, ps(0) = po = V2mKo, (4.14) 
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and at s = S, 
Vo(S) = —S Ep, K(S) = Ko + ZeS Eo, 
ps(S) = ps = V 2m (Ko + ZeS Eo). 


Thus, we obtain the transfer matrices at s = 0 and s = S as 


ios ( ^ 1 ) r Ms eae 1 ) dn) 


where again the abbreviation 


Ep 
2Xe0 


is used. 

In order to study the two-plate lens quantitatively, we need to still determine 
the transfer matrix of the space between the plates. In this region, a charged 
particle experiences uniform acceleration or deceleration without change in 
transverse momentum, i.e., a; = a;. However, the position changes in a similar 
mechanism to that of a drift, but including the effect of the change of the 
kinetic energy. For the region between the electrodes, we have from the z’ 


equation of (4.9), and using ps(s) = \/2m (Ko — ZeVo(s)), 


E E 
V Ki 
Ax =a; —2;= a; | eas =a; | a is 
o Ds(s) o VKo + Zesko 
2 2 p 
= ai v Ko (VKo + ZeSE - VKo) = a; 
Geng.” Orne 0) = Zems 2m 
2po 


o (Ds — po) 


where the following relation is used to simplify the last step, 
p5— pe = 2m (K(S) — Ko) = 2mZeS Eo, (4.16) 


and the abbreviation 
poai 
Ps + Po 
is used. Thus the transfer matrix of the gap between the plates is 


M =( ! ME dU 1 Ed (4.17) 


La =5 


0 1 


We note that if there is no change in the kinetic energy and thus ps = po, this 
matrix agrees with the matrix of a drift with length S. On the other hand, if 
the system is accelerating, we have Ls « S, while for a decelerating system, 
Lg > S, reflecting the behavior expected from a simple geometric analysis. 
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Combining the above matrices, we construct the transfer matrix of the 
electrostatic round lens with two plates as the field described in eq. (4.13). 


Mi Hs Hs em (aus 1) (8 2) (a 1) 


- ( 1c oaLs Ls ) 
~ No [1 - (po/Ps)(1--aLs) 1-eLs(po/pvs)]' 


This is a rather compact representation of the matrix. However, to more 
clearly observe the influence of the defining parameters V and S, we now also 
express the matrix in terms of these quantities. We note that alg can be 
expressed in terms of momenta using eqs. (4.16), (4.12) and (4.14), and we 
obtain 

Eo 2po 1 Ze p -ph 20 . po—ps 


aLg--— -S = --—_ = ! 4.18 
2xeo ps + Po 22Ko 2mZe ps + po 2po ee) 


Using this, the elements of M can be organized as 


3po — ps 
2po 


= 30s — 
ps 


(z|zr) =1+aLs = 


2ps 2ps 
30, — 2 
TET. h -Pay ats) = (1 - d _ ao P? Ps 
ps ps  2po 2ps 
= -302 29 Lg, 
Ps 
resulting in 
A (3po — ps)/(2po) Ls ) 

M = à 4.19 
( —3o?Ls(po/ps) (3ps — po)/(2ps) ety) 


In this representation, it is obvious that the (a|x) element is always negative, 
and thus we obtain the important conclusion that the two-plate lens always 
focuses. 

As observed above, the simplest plate lens leading to vanishing electric fields 
at far distance required two plates. However, the potential after the two-plate 
lens differs from the potential before. Thus, in order to achieve identical 
potential before and after the lens, which is often desirable in practice, at 
least three plates are necessary. So we consider a lens that consists of three 
flat electrodes as shown in Fig. 4.9, where the axial electric field E;(s) and 
the potential Vo(s) are given as 


Eo for —-S<s<0 
Es(s)= 4 -Eo for O<s<S, 
0 for ls| > S 
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Vo(s), 
Em 
-S 0 S S 


FIGURE 4.9: Layout (top) and potential profile (bottom) of the electro- 
static three-plate round lens. 


and 
—sEg4-V for -S<s<0 
Vo(s) = sEo+V for O<s<S, 
0 for ls| > S 


and the potential V at the middle plate is given by 
V = —S Eo. 


We observe that we can treat this system as a combination of two of the two- 
plate electrostatic round lenses already discussed. For this purpose, we first 
list the momenta at the plates. At s = +S, we have 


Vo(S)-20, K(ES)- Ko, p.(XS$)-po- V/2mKo, 


and at s — 0, we have 


V) =V ccSESN K(0)= Ko + ZeSEp, 
ps(0) = pm = V 2m (Ko + ZeS Eo). 


For purpose of clarification we note that if V > 0 as shown in Fig. 4.9, we 
have that Eg < 0 and pm < po. 

The first half of this lens is simply the two-plate lens discussed above. Thus 
from eq. (4.19), the transfer matrix for the left half of the system is 


^. _ (Bpo — Pm)/(2P0) Ls 
iE ( —3o?Ls(po/Ppm) (3pm ua) 
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where 
Eo Lg-8. ZEN 
2Xe0 Pm + po 

The second half consists of the entrance kick with the momentary momen- 
tum pm, the gap of length S connecting the two momentum states pm and 
po, and the exit kick with the momentary momentum po, with the constant 
electric field — Eo in the gap. The transfer matrices of these components turn 
out to be Ms, M, and Mo from eqs. (4.15) and (4.17), respectively, where 
here we use Dm na of ps. So we obtain the transfer matrix for the right 


half of the system as 


in = Nig: Ms Nts = (1 oe: a) atem i) 


QS 


e: ( 1 — aLs(po/pm) Ls ) 
a [1 — (po/pa)(1-- aLs) 1+aLs 
c. — po)/ (2pm) Ls ) 

—3o?Ls(po/pm) (3po — Pm)/(2po0) / ` 


We observe much similarity between Mr, and Mp; the off- diagonal elements 
are the same, and the diagonal elements are merely switched. The (a|x) 
element of both Mr, and Mg is always negative, thus both of the lens halves 
always focus. 

Finally, the calculation of the transfer matrix of the whole lens can be 
conducted, where the similarity of M and M R helps the arithmetic in the 
process. We obtain 


"o xr. wr. _ { (Glan (xla)n Y ( (xlx)n (vla)r 
PE AR vi = ( au ) 
= cee + (v|a)r (a|lx)r 2(a|a) r (x|a)z ) 

(x|x) r (a|x) (z|x) p (a|a) n + (xa) (a|x) r 
-«( (3/2)(po — Pm)? / (Pops) Ls(3pm — Po)/Pm ) 
—3o?Ls(3po — Pm)/Pm 1 — (3/2)(po — p)? / (pops) 


where eq. (4.18) is used to simplify the result. 

We now consider the focusing property of the three-plate lens. As before, 
Ls > 0, but we now have the factor 3p9 — pm determining the eventual sign 
of (a|x). In case the magnitude of the voltage V is small compared to the 
kinetic energy of the particle, the center momentum pm is similar to po, and 
so 3po — Pm > 0. On the other hand, in cases of extreme voltage V leading to 
large acceleration and large center momentum pm, it is conceivable to achieve 
Pm > 3po. In this extreme case, the three-plate lens actually defocuses. 


4.4.2 The Magnetic Round Lens 


Magnetic round lenses are arrangements of wires wound equidistant to the 
reference orbit, either in long arrangements similar to textbook-like solenoids 
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with a large region of nearly constant axial fields, or in short or “thin” ar- 
rangements where the field on axis rises, reaches a peak, and then falls off 
again. Thin magnetic lenses are the main staple used for focusing in electron 
microscopes, while long magnetic lenses are used in various particle accelera- 
tors for guiding the beam, including applications in muon ionization cooling. 
We have the magnetic field, linear in x and y, given as 


Ba = EV§ (e)a, By-2W)s B= -Vi (8), 
which is derived from the corresponding magnetic scalar potential 
V = Vo (8) - EVY (8) (2? +97). 
Different from the electrostatic potential in the case of the electrostatic round 
lens, the magnetic potential does not enter the equations of motion directly. 


So we simplify notation by not having it appear in the equations of motion, 
and rather express the fields in terms of the axial center field Bs(s) as 


Applying this to eqs. (4.8), and linearizing in the similar process for eqs. 
(4.2), we obtain 


j r Bs 1B 
r =a, @ =+ b+- Y, 
XmO0 2 XmO0 
DB; 1B! 1 
y = b, b' = — a — 2 E T, U = Gay 5 = 0. (4.20) 
Xmo Xmo + No 


It is immediately apparent that the motion of the two planes are coupled, 
which will lead to interesting properties. The longitudinal motion to first 
order is the same with that of a drift, thus 
lp = Dôi + li = — aii + li, Of = ĝi 

(2+ 70)? 
with length L. 

To study the transversal motion, we first express the equations of motion 
(4.20) in vector notation. Using 


(5). O (89) 


B A 
s Îz. (4.21) 


we have 
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For a particle moving parallel to the s-axis without transversal momentum, 
i.e., €; = 0, the second term in the c' equation, B! /(2Xmo): FZ, acts to produce 
a nonzero transversal momentum. Typically, the situation of B! 4 0 happens 
in the fringe field regions near the entrance and the exit, and it is the main 
reason why particles entering parallel to the s-axis will follow a transversely 
rotating motion inside the magnetic round lens. 

In the following, we will employ a dual approach in understanding the 
motion of particles in the magnetic round lens. The quantitative approach 
will be based on studying the equations of motion and solving them for certain 
specific cases. But equally important is the qualitative understanding of where 
the focusing effects of a magnetic round lens really come from. 


4.4.2.1 Hard Edge Fringe Fields 


We first discuss the fringe field effects appearing in an abrupt transition into 
a region of axial field from a field-free region. In practice this situation arises 
at the edge of a solenoid of very small radial aperture. We use the impulsive 
kick approximation in a similar manner as applied to the dipole edge focusing 
in Section 4.3.2. When entering into the solenoid of the constant field strength 
Bo at the idealized hard edge at s = 0, B, and B’, are expressed in terms of 
the Heaviside step function H and the Dirac delta function 6 defined in eqs. 
(4.5) via 

B,(s) = BoH(s), B! (s) = Boó(s), 


where the entrance lies at s — 0. Applying them to the equations of motion 
(4.21), we obtain 


04- z 5 

BoH(s) > DBoó(s) ; Bo + 
a= a+ | [PEO jg, , SOO, dus ee pea ĴZ, 
= Xm 2Xmo 2Xmo 


while Z is unaffected. Altogether we obtain the transformation relations be- 
tween the initial conditions z; and c; and the final conditions Z; and c; for 
such an idealized thin edge as 


uda: (4.22) 
2Xmo 


> 


Zp—EQ Cf = Qt 


At the exit the same effect happens, but with an opposite sign of B}, because 
we now have 


Bs(s) = BoH(—8), Bi(8) = —Boó(s) 
where the exit is at 5 — 0, so that we obtain 


Bo + 
Zp2EZ, @ =G- JZ. 4.23 
f f Pico (4.23) 


Overall we have kick effects that similar to other fringe field cases leave the 
position unaffected, but change the directions by an amount proportional to 
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FIGURE 4.10: Transversal motion of particles entering at positions x; 
initially parallel to the reference axis in the magnetic solenoid. After crossing 
the fringe fields, the particles execute rotations with radii r; equaling half of 
their entrance position. 


position. However, very different from the case of the electrostatic round lens, 
this change in direction in itself is neither focusing nor defocusing but rather 
happens in azimuthal direction, perpendicular to the radial direction. 


4.4.2.2 The Mechanism of Focusing 


Since the action of fringe fields in the transition into axial magnetic fields 
leads to azimuthal kicks, one is led to wonder where any focusing effects may 
arise from. We consider an incoming beam where all trajectories are paral- 
lel to the reference axis, i.e., c; = 0. Furthermore, because of the rotational 
symmetry, it is sufficient to limit our attention to a particle starting on the hor- 
izontal axis, which means Zr = Z; = (x;,0) in eqs. (4.22). After having moved 
through the fringe field, the particle has now picked up a velocity component 
in the negative y direction via @ = (Bo/2Xmo)J% = (Bo/2Xmo)(0, — x1). 

We assume that the particle travels for a while in the constant magnetic 
field Bo. In this field, it performs a rotation due to the nonzero transversal 
velocity picked up in the fringe field, which is v; = (Bo/2x,,0)x;vo. The radius 
of this rotational motion is given by 

ymv ym Bo 1 
= = Tito = qi, 


"= ZeBo ZeBo 2Xmo 2 


where Xmo = po/Ze = ymvo/Ze is used. So the rotation radius is exactly half 
of the radial entrance position of the particle. As it turns out, the factor of 
1/2, which can be traced back to the fact that the radial derivative OB, /0x = 
—1/2- B1 (s) is only half of that in the axis direction, will be the key mechanism 
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to focusing. 

We illustrate the resulting motion in Fig. 4.10, which shows the resulting 
transversal orbits for six particles on both sides of the origin that were initially 
moving parallel to the axis of the solenoid without any transverse motion. 
Each of these particles performs a circular motion with a radius that equals 
half of its entrance position. In particular this entails that the particles do not 
have concentric orbits like in a cyclotron, but rather that after a half period 
of their oscillations, all these particles will pass through the exact center of 
the device. 

This observation now leads to focusing; on their path from their initial 
position x; to the position x = 0 after a half period, each particle loses distance 
to the axis and is thus focused. 

We further observe that all particles rotate with the same angular frequency 
given by 


This entails that all initially parallel particles reach the origin at the same 
time, and also that the relative loss of distance to the axis is the same. So the 
speed with which they move towards the origin is proportional to their initial 
position, and it is a hallmark of thin lens focusing. 

'The picture suggests another interesting feature; initially axis-parallel par- 
ticles that are on a common line upon entering remain on a common line, 
shown dashed in Fig. 4.10, in their further motion through the region of 
constant magnetic field. Elementary geometry shows that this is actually the 
case: by connecting any particle in Fig. 4.10 (dot) and the corresponding 
center of its orbit (cross), we can see immediately that the angle between the 
dashed line and the z-axis is always half of that between the radius and the 
x-axis. We further observe that when the particles reach the origin and have 
performed a half revolution around their orbits, the dashed line will coincide 
with the vertical axis, and thus will have performed a quarter revolution; in 
fact elementary geometry shows that the dashed line performs a rotation with 
one half of the rotational frequency of particles. 


4.4.2.3 The Rotating Coordinate System 


'These observations of the idealized case now motivate the treatment of the 
general case, in which fields do not jump abruptly but rather gently depend 
on s. In this case the appearing momentary rotation frequencies of both the 
particles as well as a possible suitable rotating coordinate system are not 
constant but will change with the position s. Thus, we attempt the Ansatz of 
introducing new variables Z that describe the motion in a rotating coordinate 
system via 

2(s) = R(0(s)) - Z(s), (4.24) 


and we will arrive at the expected result at eqs. (4.28). Here, R(0) is a 
rotation matrix, and for the further discussion, the following matrix properties 
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are useful: 
Gi gt ane E a sd 1. 37 


cosÜ —sin@ 
e - ( 


sing sing) ese P= sint and R-1(6) = R(—0), 


jm) = Re) 3- (m aa =sin@-[+cos6- J, 
dR(0) _ sinÜ cosÓX sg dR() as ae 
"do - => me o) = —JR(0), and T = —0 JR(0). 


(4.25) 


Similar to the definitions of Z and c, we denote the new rotating variables Z 


and C as 
3 X A A 
z=(y), e-(i) 


and define C as the first derivative of Z with respect to s 


Guys zu ae 


Note that this does not automatically entail ¢= R-C. Rather, c differs from 
R-C as we will see now. By computing the derivative of eq. (4.24), we express 
C(s) in terms of R, Z and C. 


és) = Z'(s) = a) Z(s) + R(8)Z'(s) = —9'JR(0)Z(s) + R(6)C(s). (4.26) 


In turn, this allows us to express C (s) in terms of Z and c. 


G(s) = R-!(0). las) +O JR(0)Z(s)| = R(—0)a(s) + 8 FR(—0)2(s), 


where eqs. (4.24) and (4.25) are used. Next, we calculate the derivative of 
C(s), expressed in terms of Z and Z. 


4-0" J R(—0)z(s) + p AY as) + 6 JR(—0)z'(s) 
T (oj - e?) R(—0)2(s) + 20 FR(—6)e(s) + R(—0)e" (s), 


where the relations (4.25) and c — Z' are used. 
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We now insert the equation of motion (4.21) for @’ above, and express C"(s) 
in terms of Z and C using eqs. (4.24) and (4.26). 


Ó'(s) = ("3 = wa R(-0)z(s) + 20' JR(-0)&(s) 


+R Fas) a2 Jats] 


Xm0 2Xmo0 
- (e zB) 5-08] awe (28 E 
= + — —0)Z(s) + | 20 + — | JR(—0)č(s) 
2Xm0 mo 


B! ^ ^ B, a| > 
E [IG TI ) J — 0? [ +20 G + ) i Z(s) 
2Xmo 2X%mo 
B za 
+2( ++) JC(s), 
2Xm0 ( ) 
where the relations (4.25) are used. 
The resulting equation of motion for C looks rather complicated, but a 
closer inspection shows the same factor appearing repeatedly. Indeed, if we 


demand 
Bs 


0 t Z—— =0, 
2x mo 
which is equivalent to 
* B&(s) ,. 
(s) = -f d5, 4.27 
(s) Rowen (4.27) 


the equation for C’(s) greatly simplifies to 
C'(s) = —0? Z(s). 


This means that the motions of the two coordinates of the vectors Z and 
C fully decouple, and we hence have the following simple set of first order 
differential equations to describe the motion of Z(s) : 


E 


Z'(s)=C(s), OC" (s) = -0?Z(s) = — (28) Zils). (4.28) 


So the motion in the rotating system is like a harmonic oscillator with 
varying strength, which is given by the square of the angular frequency 6’ 
of the rotation of the coordinate system, which itself is proportional to the 
longitudinal field B,(s). It is a quite remarkable and yet simple result that 
fully describes the linearized motion in the magnetic round lens. 

We note that, as a consequence, a short magnetic solenoid is focusing, 
and the focusing power is proportional to 0? œ 1/x2,9 e 1/p%, whereas that 
of a magnetic quadrupole is proportional to 1/ymo c 1/po. Therefore the 
advantage of using magnetic quadrupoles compared to magnetic round lenses 
becomes more pronounced for beams of higher momentum. 


104 An Introduction to Beam Physics 


4.4.2.4 The Solenoid with Hard Edge Fringe Fields 


We now study an idealized long solenoid with vanishing field outside and 
with a constant interior field of 


B.(s)= Bo, By = By =0. 


We begin the discussion with the observation that the form of the equation 
of motion (4.28) entails that the quantities Z and C vary continuously even 
when passing through a hard edge fringe field. In fact, different from the 
situation for Z and c, there are no delta functions appearing which led to the 
discontinuities in eqs. (4.22) and (4.23). So in both the entrance and the exit 
of the hard edge fringe field, instead of eqs. (4.22) and (4.23), we simply have 


Z,=2;,, Õ=Ğ. (4.29) 
Assuming that the beginning edge of the solenoid is located at s — 0, we 
have that 
Bo , do Bo 


(s) =— 9 = — = tant). 
(s) 2207 35 F (constant) 


The angular frequency 0’ is now constant on the inside, and we use the ab- 
breviation 


B 
w=- (constant). 
2Xm0 
Thus we obtain a simple harmonic oscillator solution for Z. Using the initial 
conditions Zo and Co, we have 


E | 4 
Z(s) = cos(ws)Zo + F sin(ws)Co, 
C(s) = —wsin(ws)Zo + cos(ws)Co. (4.30) 


Now this solution has to be expressed in terms of the original coordinates 7 
and c. 

We begin by observing that outside the beginning of the solenoid, we simply 
have 

Z-Z G-&a&. (4.31) 

Next, because of eqs. (4.29), even just after entering the solenoid, we have 
Z; Lx 6 = cj. In the solenoid itself, the quantities Z and C change accord- 
ing to eqs. (4.30) until the end of the solenoid is reached, where they have 
the values Z(L) and C(L). When exiting the solenoid, Z and C again remain 
unchanged because of eqs. (4.29). In the outside region, there is no field left, 
so in eqs. (4.24) and (4.26) we have 0' — 0, thus the transformation equations 
simplify to 


Z(L) = R(oL)-Z(L, | &L)- R(wL)- C(L). (4.32) 
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We now combine the various matrices and use y = wL. We obtain the result- 
ing 4 x 4 transfer matrix Movt-int-in of the solenoid of length L for (x,a, y, b), 
representing first entering into the solenoid, then passing through its interior, 
and then exiting out of the solenoid, as a combination of eqs. (4.31), (4.30) 
and (4.32): 


Mort int-in — f). Maole) 


cos? o sinycosy/w  —sinecoso — —sin? g/w 
| | ^wsinecoso cos? o wsin? o — sin Y cos o (4.33) 
| singcoso sin? p/w cos? i sing coso/w [| 
—wsin?p  sinycosp  —wsingcosp cos? y 
where we used the 4 x 4 matrices for the harmonic oscillator solution 
cosy  sing/w 0 0 
^ _ | -wsiny cosy 0 0 
Mno(e) = 0 0 cosy  sing/w |’ (4.34) 
0 0 —wsiny cosy 
and for the rotation 
cosy 0  -—sineg 0 
^ 0 cosy 0 — sin Y 
R(y) = (4.35) 


sin 0 COS (p 0 
0 sin Y 0 COS (p 


We observe that these matrices commute, i.e., Myo: Ê = R- Myo, which 
helps simplify the matrix arithmetic here and below. 

We note that the matrix M?"'-i?-i^ has unit determinant since both the 
harmonic oscillator solution matrix Myo and the subsequent rotation R do. 
It is also easy to show that the longitudinal angular momentum is conserved, 
because 


^ > 


Zj x & = AL) x AL) = (ÊWL) - ZU) x (RWL)- E(L)) = ZU) x Č(L) 


Z 


Z(L 
= - + Sing) x (—wsin eZ; + cos e) - Z = Ži x Ĝi 
Zi X 


& 
Cj. 


Now we may wonder what happens if we study the motion not only from 
the field free regions before to the field free region after the solenoid. For 
this purpose, we first remind ourselves of the 4 x 4 transfer matrices of the 
entrance and the exit edges, which according to eqs. (4.22) and (4.23) are 


10 00 1000 
^in | 0 1 —w O ^ dE 01-00 
ees 00 10]" M 0.01 0 
w 0 01 —w 0 0 1 
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We observe (M"*)-! = MY”, and both of the matrices have determinant 1. 

We discussed the behavior of the edges in Section 4.4.2.2, and we can now 
determine the change in the longitudinal angular momentum across the edges. 
At the entrance edge, using eqs. (4.22), 


Zp x é& = Zi x (&—wJZ) = % x & — wk, x (JH) 


LEX a-o) x (ajz x G +w (£? +y?) Es = Zi X Ci + wr? ës, 

a 
where €; is the longitudinal unit vector. Thus the amount of angular momen- 
tum generated is wr? at the entrance and —wr? at the exit. 

The transfer matrices for various situations below can be obtained by com- 
bining M"t-int-in. yyin, out and their inverse matrices. The transfer matrix 
of the interior part is obtained by removing the entrance and the exit matrices 
from Jfout-int-in as (Agout)—1 . [out-int-in ` (ME, and we obtain 


1 singcosy/w 0  —sin?o/w 
yet = 0 cos(2y) 0  -—sin(29) 
-| 0  sin?g/w 1 sinycosy/w 
0 sin(2y) 0 cos(2y) 


The determinant is again 1. Interestingly, the longitudinal angular momentum 
is not conserved in general when going through the interior, and the amount 
of the change is vr? — r7). 

When the particles start inside the solenoid and then exit, which typically 
happens when the particles are “born” inside the solenoid, we obtain Afout-int 
by removing the entrance matrix from Mo"t-int-in jg, jpoutint-in , ( frin)-1. 
or equivalently adding the exit matrix to Aint, i.e., Mt. MiP, as 


1 singcosy/w 0  —sin? y/w 
jj out-int E 0 cos? p w —sin (p COs q 
B 0 sin?y/w 1 singcosy/w 


—uw sinycospy 0 cos? o 


which again has determinant 1. As discussed above, while M9"t-int-i» conserves 
the longitudinal angular momentum, M? does not, thus the longitudinal an- 
gular momentum changes in the process. 

Finally, when the particles enter into the solenoid and remain inside, we 
have Mi of the form 


cos?  sinycosy/w —sinycosy — sin? y/w 
yritin | sin(2p)  cos(2%) —wcos(2y) | —sin(29) 
~ | sinycosy — sin? p/w cos? i sin y cos p/w 


w cos(2y) sin(29) —w sin(2y) cos(2y) 
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as a result of removing the exit matrix from Mo"tintin jg, (Mpout)-1 . 
Mour-int-in Again the determinant is 1, and the longitudinal angular momen- 
tum changes. 

We conclude our discussion with a more detailed comparison of the treat- 
ment of the solenoid with the results of the electrostatic round lenses. We 
note the similarity between the solenoid including entrance and exit fringe 
fields and the two-plate electrostatic lens. Indeed, the magnetic scalar poten- 
tial, expressed in terms of the angle 0 in eq. (4.27), has risen steadily inside 
the solenoid and is reaching a plateau outside of the solenoid, just as the 
electrostatic potential rose steadily between the plates of the two-plate lens. 

It is thus illuminating to study the case of two solenoids of equal length and 
opposite strength, which will lead to a vanishing magnetic potential change 
at the end of the second solenoid, and conceptually corresponds to the elec- 
trostatic three-plate lens. In that case, the fact that the potential returns to 
its original constant value entailed that the particle's energies are the same 
as before. Here by virtue of eq. (4.27), we obtain that the net rotation of the 
system is zero. 

To be quantitative, we can obtain the transfer matrix of this system by com- 
bining those of two opposite solenoids with hard edges. We remind ourselves 
that the solenoid transfer matrix in eq. (4.33) was obtained as the product 


M E RCo) . Mnuo() 


of the matrices R(y) and Myo(v) defined in eqs. (4.35) and (4.34). So the 
transfer matrix of a solenoid of opposite field is simply given by 


M; = R(-e): Muo(-¢). 


Now we can use the fact that R(y) and Mzo(y) commute as noted above, 
and obtain for the combined matrix 


M = M5 - My = R(-¢)- Muo(-¢)- RY): Muo(¢) 
= R(-y) - R(o) - Mno(-9): Mnuo(e) = Muo(2¢), 
where we have used that R(y)~! = R(—y) and Myo(—¢v) = Myo(y). So 
indeed any rotation is removed, the x and y motion are fully decoupled, and 


correspond to a simple harmonic oscillator that is equal to that of a solenoid 
of twice the original length. 


4.5 *Aberration Formulas 


In the previous sections we have discussed in detail the linearization of the 
motion in particle optical coordinates, and the resulting transfer matrices for 
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common particle optical elements. However, in general the motion is not 
linear, and in many situations it is important to take into account various 
contributions of the nonlinear effects. 

Unfortunately, as straightforward as the determination of linear transfer 
matrices is in many cases, the determination of nonlinear terms, especially 
those of higher order, becomes exceedingly more difficult using paper and 
pencil methods. In the following we will describe a general method that 
in principle allows the recursive determination of aberrations of higher and 
higher orders, but which in practice quickly succumbs to a rapid increase in 
complexity for higher orders. 

Let us assume we are given the system described by the following ordinary 
differential equation (ODE) 


z^ F (Fs), 


which satisfies fl (0, s) = 0. We perform a Taylor expansion of the right hand 
side. Because the system is origin preserving, the first contribution is linear, 
and altogether we have 


oo 
—F = M(s)-7+ MN; s), 

j=2 
where the N; are polynomials of exact order j, the coefficients of which may 
depend on s. 


The first step in obtaining a perturbative solution of the system is a lin- 
earization as in the previous sections. We have 
d A 


477 M(s) r. 


For this system, we determine a system of n independent solutions hi (s), 
k — 1,...,n, that satisfy the initial condition 


I (0) = (0,0,...,. 1 ,...0,0)7. 
kth 


We define the matrix 


and observe that the general solution of the linearized problem with initial 
condition 7; is then given by 


F(s) = L(s) fi. 


In practice, the determination of L may be possible in closed form, depend- 
ing on the structure of M, or may have to rely on numerical integration. For 
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the special case that M is piecewise constant, then for every such piece, 
one can try the ansatz lk = Uy - exp(wys), which leads to the condition 


wj Uy exp(wis) = M. exp(wys), 


an eigenvector problem. If M has n distinct eigenvalues, we are done, and 
depending on whether wj is real or complex, the solutions can also be ex- 
pressed in terms of sin, cos or sinh, cosh. In case of multiple eigenvalues, 
often solutions of the form s - sin, etc., can be found. 

The next step consists of an expansion of r(s) in a Taylor polynomial 


F(s) = L(s) ri 9, Ry (5), 


j=2 


where R; denotes a polynomial of exact order 7 in the initial conditions, the 
coefficients of which may depend on s. We insert this expansion into the ODE 
and obtain 


where Q;’s (j > 2) are polynomials of exact order j in 7, which result from 
inserting f into N j 8. This insertion leaves no linear or constant parts, which 
is due to the fact that the ODE is origin preserving. This will prove crucial 
later in the algorithm for the solution. 
We now sort the result by order. The linear part has the form 
d A 


qi hls) = M(s) £9), (4.36) 


and the higher order parts, j > 2, assume the form 


d = re = eee 
3s Fi (570 = M(s) FR; (sri) + Qs, L, Re), (4.37) 
where Q; contains only R, with k < j. So for j — 2, 3,..., we obtain a 


triangular system of ODEs. It can be solved iteratively in an order-by- 
order manner, and then each of the differential equations for R; contains 
only lower order terms Hy that are already known. In this way, the ODEs 
decouple and become inhomogeneous. 

Initially at s = 0, we have the initial condition r(0) = 7;, and 


L(0-—1i,  R;(07)—0 for all j =2,3,.... 
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In order to solve the inhomogeneous eq. (4.37) of order j, we first deter- 
mine the homogeneous solution, and then perform a so-called variation of 
parameters. The homogeneous solution is exactly the same form as for the 
linearized part. lo obtain the inhomogeneous solution, we make the ansatz 
R;(s) = L(s) - T(s). Then 


ig (Zio): T (s) + L(s)- 


d T (s). 


ds 
Using eq. (4.36), the first term in the right hand side is 


(9) Tt) = i) £9) T6) = 1) Ru) 


* d= Ge es 
is) SF (s) = Gils, È, Ra), 
that is E 
To= [ EGGE t, Ros, (4.38) 
0 


where the choice of the lower integration boundary as 0 ensures that T(0) 
= 0, which agrees with the initial condition R; (0) = 0. Altogether we have 


R; (s) = L(s) - [ £-1(3)Q,(3, È, Re) ds. 


The integral is often referred to as the aberration integral, and the inte- 
grand Ec 19; as the driving term. The complete solution then is obtained 
as 


P6) = Li) «YR (o) = LG e Y H): f DOGG, L Rajas. 


So, once the linear solution is known, everything else just boils down to 
quadratures. If within a piece in which it is constant, M (s) is diagonalizable, 
the linear solutions can be written as combinations of sin, cos, sinh, cosh and 
s. In other important cases where M(s) is singular, often a complete set of 
linear solutions that are polynomials in s can be obtained. 

In both of these cases, the insertion into the polynomials R; (s) leads to 
terms that are polynomials in sin, cos, sinh, cosh and s. By expressing such 
functions in terms of exponentials times powers of s, one can show that the 
result of any integration can again be expressed as a polynomial of sin, cos, 
sinh, cosh and s. 

For practical cases, it is worthwhile to discuss the complexity of the proce- 
dure. With each new order, the expansion of the ODE becomes more compli- 
cated; then all previous orders have to be inserted, multiplied with the linear 
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inverses, and integrated, resulting in substantially more terms than for the 
previous order. Altogether, the effort increases extremely dramatically 
with the order being considered, and for typical systems, it is practical only 
to orders around five. 

Computer codes that use the above procedure usually contain a library of 
procedures that compute the aberrations for each particle optical element of 
interest. The aberrations of combined systems is then determined from those 
of the pieces with the help of a composition procedure. The Differential 
Algebraic (DA) approach, as described in Chapter 5, allows the computation 
of aberrations to any order in an elegant way without the need of explicit 
formulas for aberrations. 

To illustrate the method of computation of aberrations with a simple ex- 
ample, let us consider the differential equations 


1 1 2 
r —a, a = —r + kztî, 


which corresponds to the horizontal motion in a quadrupole with a superim- 
posed sextupole. We first perform the linearization to obtain 


JA 9-9 C) 


The linear solution then is 


(2- (EB ey (2) aa m 


'The next step is the expansion of the ODE, which is already done in the 
given differential equations. We then insert the solution expanded up to the 
second order in the initial conditions x; and a; 


z(s) — (x|z)z; + (zlaja; + (z|zx)z? + (x|za)v;a; + (z|aa)a2, 


a(s) = (a|x)z; + (a|a)a; + (a|zz)z? + (a|xa)z;a; + (alaa)a?, 


into the ODE, and obtain 


(x|z)' xi + (xla) aj + (zler) x? + (xz) xia; + (z|aa)' a 


and 


(ajx) xi + (a|a)' a; + (alaa)'x? + (alza) z;aj + (a|aa)'a2 
=— [(x|x) a; + (zlaja; + (x|xx)z? + (r|ra)z;a; + (|aa)a;] 


+k [(ala)?a? + 2(e|x)(a|a)aia; + (z|a)?a2 4- ---] , 


where we can ignore the higher order terms, since we are interested only in 
order two. 
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The second order equations then read 


(c|xa) z2-- (x|xa) z;a;--(m|aa)'a2— ^ (a|xa)a2 -- (a|xa)o;a; 4- (a|aa)a?, 
(a|zz)'z? -- (a|za) z;a;-- (a|aa)' a2 = —[(a]xx)x? + (z|za)z;a;4- (z|aa)a?] 
+ k[(2|x)?27 -2c|z)(z|a)zia;- (x|a)^a;]. 


where the last line proportional to k is the inhomogeneous part Qs. and using 
eq. (4.39), 


G= (=)= " 
2 Qa k [cos? s - £? + 2 cos ssin s - xia; + sin? s - a2] ` 


We make the ansatz R2(s) = L(s)T(s) : 


(ee + (x|xa)zja; + eed i | COS 8 2) T (s). 


(a|zz)z? + (a|za)z;a; + (a|aa)a2 —sins coss 


so. (Tr\ fll; n [lf(cos8 —sins 0 $ 
ao o) L Gas = | i i Los es 


So 


S 
nc (— sin $ - Qo,) ds 
0 
Ss 
=i f (— cos? 5sin 5 - x? — 2cos 8sin? §- aja; — sin? 3 - a7) ds 
0 


2 1 2 
=+} (cos? s — 1)22 — gsin® s- vias + (coss ~ geo - 3) 4l. 


S 
nel cos 8 : Q2ad5 
0 
=if (cos? s - £? + 2 cos? 5sin 8 xja; + cos 8sin? 5 - a7) ds 
0 


1 2 1 
=k (sins - jus) r? — z (cos? s — 1) zia; + Esin? s-a? . 


Then we obtain Rə (s) = Ê (s)-T (s) , which yields the second order elements 
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of the transfer map 


NES ia 
— sin" Ss — —coss = 
3 3 3 


Qo 
o 
[2l 
va] 
l 


2 PN. 
zcoss-ct- 
3 3J’ 
; D ) 

peu S ; 


2 


si z cos? s + 2 
Ms = S z Coss 
3 3 i 
2 2 
(alaa) = k (-5 sin s cos s + pe) . 


In a similar fashion, but with much more effort, one can also compute the 
third and higher order terms. 


Chapter 5 


Computation and Properties of 
Maps 


Up to now, the equations of motion have only been solved perturbatively to 
the first order. Yet the knowledge of the nonlinear part of the solution is 
also needed to determine precisely the performance of a device. "Traditionally, 
this is done analytically using perturbation theory (see Section 4.5 for more 
details). In the past, a tremendous amount of knowledge about aberrations 
for various kinds of devices has been accumulated. Yet this approach is far 
from being systematic, making the simulation prone to errors, and it is diffi- 
cult to obtain an accurate solution for a realistic device where no analytical 
solution exists. In this chapter, a modern method, the Differential Algebraic 
(DA) technique, for computing the transfer map to arbitrary order, will be 
described. But before we embark on this task, we will first classify aberrations 
that can appear in transfer maps in terms of their symmetries. 


5.1 Aberrations and Symmetries 


Recall that the transfer map of an optical system relates final coordinates 
to initial coordinates via 


Zr = M), 


where Z = (z,a, y, 5,1, ð). In the previous chapters, we were concerned mostly 
with the linearized part of the map, which describes the major part of the 
motion and which can be described by transfer matrices. The matrix elements 
were denoted as (x, a), etc. 


In order to study the effects of the motion very precisely, it is necessary 
to also consider higher order or nonlinear effects. For this purpose we Taylor 
expand the map (in a rigorous sense the question whether the map can actually 
be Taylor expanded is rather nontrivial, but we ignore this here), and use 
names for the coefficients similar to what we had for the linear motion. We 
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write 


a 5 (x qt? ate y" pe [^ 5°) qt? att yt» bie [is 
afs = Xo (a git q's y pelt 9H) git q's y" pe [5 $15 
yf 5 (y git q's y pee [% 5°) xt? atty’ pie [gis 
by = 5 (ba q's y" pe (i ó5) "n att yi ptt [t os 
lg = 5 (1 git atty’ pep git q's y" pe [% 8, 
Of = » (ó git a^ yv p [à ó5) git q's y" pe [% $15 


where the sums go over all six-tuples (iz, ia, ty, i5, ii, i5); for convenience, they 
are usually sorted by total order. The Taylor coefficients belonging to terms of 
orders 2 or higher are usually called aberrations, as they describe corrections 
to the linear part of the map that are usually small if the phase space variables 
are small. 


In most cases, the freedom of the aberration coefficients is severely restricted 
by the presence of a variety of symmetries. First, in many cases the motion 
of one of the variables does not depend on the values of some other variables. 
For example, if the motion is time independent, we have 


(Z;|z'*af«y'»p*[595) 20 if 40, 


where j = 1,...,6 and Z; is defined in eq. (2.2). Furthermore, in this case 
we know that the kinetic plus potential energy of the particle is conserved, 
and we have that 


(ó|a atty» bl) =0 except (6/6) = 1. 


5.1.1 Horizontal Midplane Symmetry 


'This is perhaps the most important symmetry in beam physics, as it affects 
almost all devices: bending elements, quadrupoles, sextupoles, higher order 
multipoles, cyclotrons and all the combinations of them. It requires that the 
motion of charged particles is always symmetric around the midplane (the z-z 
plane), which is illustrated in Fig. 5.1. 


In a system with midplane symmetry, two particles that are symmetric 
about the midplane at the beginning stay symmetric throughout the system. 
Suppose that a particle is launched at (£i, Yi, di, ai, bi, ti). After the map M 
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FIGURE 5.1: Trajectories of particles in a system with horizontal mid- 
plane symmetry. 
is applied, its coordinates are 


E Ti, Qi, Yi, bi, li, Ôi , 


( ) 
alti i, Vi, bi, Li, Ôi), 
( 


: 


: 


yNTi i, yi, bi, li, Ôi , 


: 


ixi, Qi, Ui, bi, bis ôi 


: 


? 


MU ARE E E 
: : 


) 
(Xi, Qi, Yi, bi, li, Ôi), 
) 
). 


a Gi, Yi, bi, li, Ôi 
Under the presence of midplane symmetry, a particle that starts at 
(his TY» di, Qi, —bi, ti) 


must end at 
(zr, —yf dp, af, —bg, ty), 


which entails that 


Lf = ma(zi, ai, —yi, —bi, li, i), 
af = ma(zi, ai, —yi, —bi, li, ôi), 
—yj = My (zi, ai, —yi, —bi, li, 0i), 
—by = my(zi, ai, —yi, —bi, li, ôi), 

lp = Mu (Ti, ai, —yi, —bi, li, ôi) 


ôf = Ms (Ti, Gi, —li; —bi,l;, ôi). (5.1) 


ds 


, 


Thus flipping the signs of y;, b; simultaneously flips the signs of yr, by, but 
leaves zf, ap, lf, Of intact. Flipping the sign of y;, b; simultaneously produces 
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a sign of (—1)'"*^ in each monomial. So i, + ij must be odd for yp, by and 


iy +% must be even for all others. So we obtain 
(x xte ate ytu po [a 8è) 0 for iy +ib odd, 
(a xte ate ytu pio [8 5°) 0 for i,-- iy odd, 
(y xt giz ytu po [8 oe) —0 for iy+%) even, 
(b a ate yy pie [a 5°) 0 
(I| sa eg'v p Pais = 0 
(ôx atey» bel" 6") =0 


for iy +i» even, 


for ty +i, odd, 


for ty +%, odd. 


(rjr) (ma) 0 0 (al) (ad) 
(alz) (ala 0 0 (all) (alo) 


ae 0 0 (wy) (y) 0 0 
0 0 (by) (lb) 0 0 p’ 
(jz) a) | 0 0 D (s) 
(8|y) (la) O (0 (Bll) (őlő) 


which is a form seen earlier for electrostatic and magnetic bending elements. 
Altogether, the symmetry entails that to any given order, roughly half of all 
aberrations vanish. 


5.1.2 Double Midplane Symmetry 


Several devices have a midplane symmetry not only around the horizontal 
plane, but also around a vertical plane. This is the case for all electric cylin- 
drically symmetric devices, as well as quadrupoles, octupoles, and in general 
4k poles. In this case, in addition to the requirements we just had, we obtain 
a second set in which the roles of x, a and y, b are interchanged. In this case 
we obtain 


for iy+% odd or istia even, 


nann 
c 9 8 


II 


for iy+% odd or itg+%q even, 


II 


for iy tip even or iys+ia odd, 


for iy tip even or is+ia odd, 


— 
~ 


Il 


for iy-- ip odd or istia even, 


II 


— 
o 
Il 

ccc cco 


~ 
œr 


for iy tis odd or i,--i, even, 
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and altogether, about three-fourths of all matrix elements vanish. To first 
order, the matrix must have the special form 


(rz) (x) 0 0 
Pub du Eo 

yam yy y 

M=| o o (y (b 
0 0 0 0 (QJ) (d) 
0 0 0 0 (Bi) (3) 


0 0 
0 0 
0 0 
0 0 


which is what we observed in the case of the drift and the electric and magnetic 
quadrupoles. 


5.1.3 Rotational Symmetry 


One special case of the double midplane symmetry that we just discussed 
is the full rotational symmetry that round lenses satisfy. In this case there is 
a symmetry going beyond what double midplane symmetry requires; the map 
has to be invariant under a rotation in the z-y plane. Let the rotation angle 


be $. The linear transformation R(Z) = R- Z is described in terms of the 
matrix 


cos@ 0 sind 0 0 0 

0 cos@ 0 sing 0 0 

-|7 sing 0 cos 0 0 0 

E 0 —sinġ 0 coo 0 0 , 
0 0 0 0 1 0 
0 0 0 0 0 1 
and we must have that the transfer map satisfies 
MoR=RoM. (5.2) 


In the variables we are currently using, the study of the influence of the 
rotation on the map is somewhat cumbersome, and for this purpose it is 
actually better to choose complex coordinates 


z=xz+iy, w=atib, 
as well as their complex conjugates 
Z—mr—iy, W=a-—ib. 


In these complex variables, the map R has the simple diagonal form 


e o 0 00 0 

0 e 0 0 0 0 

o ue 0 0 0 
RE gs gg: e 9 4]? 

Gi 0 0 0 1 0 

0 0 0 0 0 1 
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and its effect in eq. (5.2) is easy to study. It turns out that in the map only 
those terms that have the form 


zf = mojf.(zE,Qww), wr = wi: fwlzz, ww) 


are allowed to remain. In passing we note that this situation is remarkably 
similar to what happens in the theory of normal forms of repetitive motion 
[5]. 

There are two types of rotational symmetry: One is characterized by the 
system being invariant under a rotation of any angle, which we call con- 
tinuous rotational symmetry; the other is characterized by the system being 
invariant under a fixed angle, which we refer to as discrete rotational sym- 
metry. The former is widely seen in light optics where almost all glass lenses 
are rotational invariant and in electron microscopes where solenoids are the 
primary focusing elements. The latter is preserved in quadrupoles and all 
higher multipoles. 

For the analysis of both cases we proceed with the above complex coordi- 
nates. After expressing x, a, y and b in terms of z, Z, w and w, the transfer 
map is transformed into 


KP LU AE E eaaa 
we Qf E iby Fy 23 71» UE V» "13 UNE 


( Ex ) - y ( Cz ) ght gi apis pla th gia. 
2 PR 
MY aiaia YS jijajsjajtja 


Note that besides z and w, also z and w will appear, contrary to the familiar 
Taylor expansion of analytic functions. This is due to the fact that while the 
original map may be Taylor expandable and hence analytic as a real function, 
it is not necessary that the resulting complex function is analytic in the sense 
of complex analysis. 

Given the fact that a rotation by ¢ transforms z to e’?z and w to etw, 
rotational symmetry requires that a rotation in initial coordinates results in 
the same transformation in the final coordinates, ie., zy — ei? ze, wr > 
e'?w;. Inserting this yields 


where 


(à — j2 + j3 — ja — 1) $ = 27n for x,a, y, b terms, 


(jı — ja + Ja — ja) $ = 2nn for t,dterms, 


where n is an integer. 
For continuous rotational symmetry, which means invariance for all ¢, the 
ji (i = 1,2,3,4) should be independent of ¢. Thus we have 


jı — j2 +j3—j4=1 for zz and wy, 


jı— j2 +js—j4=0 for ty and dy, 
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TABLE 5.1: 


Number of aberrations 


Order Ji J2 J3 jA 
1 1 0 0 0 

0 0 1 0 

3 2 1 0 O0 

2 0 0 1 

0 1 2 0 

0 0 2 1 

1 1 1 0 

1 0 1 1 


which eliminate many terms. First, all terms with jı 4- j2 - j3 4- j4 even vanish, 
because ji + j3 and jo + ja always have the same parity, which means that 
jı — j2 ja — ja is also even. This implies that in rotationally symmetric 
systems, all even order geometric aberrations disappear. As a summary, all 
remaining z and w terms up to order 3 are shown in Table 5.1, where the 
order represents the sum of the j;. To illustrate the characteristic of such a 
map, let us derive the linear matrix from the conditions above. First define 


(z|z) z (Cz) 100000; (z|w) = (€z)oo1000; 


(w|z) = (Cw) 100000; (w|w) = (Cw )oo1000- 
The first order map is then given by 


zp + iyy = (2le) Gri + iyi) + (edu) (ai + ibi), 
ap + ibp = (wļ2)(æ: + iyi) + (wlw) (a; + ibi), 


which entails that the linear matrix is 


Rele) -9(e) Relv) —9(elu) 
, [| 9G) mel) Selu). Rei) 
M — | (uz) —S(wle) R(wlw) —(wlw) 63) 
Stea wl) Sal. Sw) 


As an example, we show the second order map of a solenoid, which has rota- 
tional symmetry, but exhibits a coupling between x and y, and a and b. 

Table 5.2 lists the coefficients of the second order map of a solenoid, which 
shows that indeed all second order geometric aberrations vanish, which is a 
consequence of the rotational symmetry. 

Eq. (5.3) also shows that a rotationally invariant system preserves midplane 
symmetry to first order when the first order coefficients are real numbers. In 
fact a simple argument shows that this is true even for higher orders. In 


122 An Introduction to Beam Physics 


TABLE 5.2: The second order map of a solenoid (Exponents in the 
initial variables z, a, y, b, 0, ô) 


Tf af Yf b f l f exponents 
0.999662 -0.408336e-3 -0.186647e-1 0.761884e-5 0 100000 
0.799815 0.999662 -0.149334e-1 -0.186647e-1 0 010000 
0.186647e-1 -0.761884e-5 0.999662 -0.408336e-3 0 001000 
0.149334e-1 0.186647e-1 0.799815 0.999662 0 000100 
0 0 0 0 1.000000 000010 
0 0 0 0 0.163101 000001 
0 0 0 0 -0.112007e-3 200000 
0 0 0 0 -0.876499e-9 110000 
0 0 0 0 -0.219388 020000 
0 0 0 0 -0.102394e-1 011000 
0 0 0 0 -0.112007e-3 002000 
0 0 0 0 0.102394e-1 100100 
0 0 0 0 -0.876499e-9 001100 
0.370284e-3 0.223860e-3 0.102326e-1 -0.836226e-5 0 100001 

-0.438474 0.370284e-3 0.163792e-1 0.102326e-1 0 010001 

-0.102326e-1 0.836226e-5 0.370284e-3 0.223860e-3 0 001001 
0 0 0 0 -0.219388 000200 

-0.163792e-1 -0.102326e-1 -0.438474 0.370284e-3 0 000101 
0 0 0 0 -0.134185 000002 


complex coordinates, eqs. (5.1) are transformed to 


Pg 

Sm 

SX A 
ll 


e ) (Zi, Zi, Wi, Wi, ti, di) 


> (2) = (z) (Zi, Zi, Wi, Wi, ti, di) 


SUR 


PE DS hay he ex 40 Jum s aod o 
J13233343tJd 


gd ZI2 wi wi tt dja ; 


which shows that all coefficients have to be real numbers in order to preserve 
midplane symmetry. 

For discrete rotational symmetry, invariance occurs only when $ = 2r/k, 
where k is an integer. Hence the nonzero terms satisfy 


ji — j2 + j3 — ja — 1 = nk. 


In general, a 2k-pole is invariant under rotation of ¢ = 2r/k. For example, 
for a quadrupole, we have k = 2. Hence the nonzero terms satisfy 


ji — j2 + J3 — j4 = 2n + 1. 


Like round lenses, systems with quadrupole symmetry are also free of even 
order geometric aberrations. The linear map of a quadrupole can be obtained 


Computation and Properties of Maps 123 


from 


Jı — j2 + J3 — j4 = 1, 
which is 


ap +iys = (z|z)(zi + iyi) + (z|w)(ai + ibi) + (z|Z)(@i—tyi) + (|Ñ) (ai — ibi), 
ag + iby = (w|z)(vi + iyi) + (w|w)(a; + ibi) + (w|z) (vi — iyi) + W/W) (ai — ibi). 


Since a quadrupole has midplane symmetry, all the coefficients are real num- 
bers. Thus its linear matrix is 


(z|z) + (z|z) 0 (z|w) + (z|w) 0 
0 z|z) — (2|Z 0 (z|w) — (z|@) 
(w|z) + (w|z) 0 (w|w) + (w|w) 0 
0 (w|z) — (wz) 0 (w|w) — (w|w) 
For other multipoles, we have 
Lad 
jı — j2 + j3 — ja = nk +1 = 1 k= gA us (5.4) 


k+1 


With midplane symmetry, the linear matrix of a 2k-pole is 


TER 
M=| o 0 (Ql (zw) 
0 — 0 (wlz) (wlw) 


(5.5) 


Since the linear matrix of a 2k-pole (k « 3) is just a drift, it satisfies eq. (5.5). 
Eq. (5.4) shows that the geometric aberrations appear only for orders of at 
least k — 1. The fact that multipoles do not have dispersion determines that 
the chromatic aberrations do not appear until order k. This can be easily seen 
from the equations of motion (3.22). Therefore, a 2k-pole is necessarily a drift 
up to order k — 2. 

A lens with rotational symmetry is frequently called a round lens. The 
main examples are magnetic solenoids and electrostatic round lenses. Since 
rotational symmetry is the highest degree of symmetry a lens can have, round 
lenses have the fewest number of aberrations. As a result, they are widely 
used in low energy electron optical devices such as electron microscopes. At 
high energy, round lenses are too weak to be effective. 


5.1.4 Symplectic Symmetry 


Another important symmetry of the motion is due to the fact that the mo- 
tion is indeed obtained as the solution of Hamiltonian differential equations. 
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In this case, one can show that the Jacobian M of the transfer map M, i.e., 
the matrix 


0.M31/021 OM, /022 OM, /02a OM, /0z4 OM, /Ozs OM, /0z6 
0.M»/0zi OM 3/02» OM2/0z3 OM 2/024 OM2/0z5 OM2/0z6 
0.M3a/0zi 0.M 3/022 0.M3/0za OM3/0z4 OMs3/0z5 OMs3/0z6 
OM4/O21 OM4/Oz2 OM4/O23 OMa4/Oz4 OMa/Ozs OMa/dz6 |’ 
OM3/0z1 O.Ms/Ozo 0Ms/0z3 O0Ms5/0z4 OMs/Ozs O0Ms5/0z6 
OM6/0z1 OM6/dz2 O.Mg/Oza OM6/0z1 OM6/0z5 OMe/Oz6 


has to satisfy the condition — . 
MT JM = J, (5.6) 


where J is the totally antisymmetric matrix 


0. 1 0 0 0 0 

-1 0 0 0 0 0 

re, 0 0 0 1 0 0 
0 0-1 0 0 0 

0 0 0 0 0 1 

0 0 0 0-1 0 


The proof of this so-called condition of symplecticity certainly goes beyond 
this volume and can be found, for example, in [5]. But we can readily see 
that the symplectic condition, which mixes in a very defined way the terms 
OM, /0z; that are themselves power series, entails a large variety of nonlinear 
restrictions between the aberrations. 

From eq. (5.6), we have 


—JMTjM = Î > -MJMT JM = M 5 —MJMT j — E, 
hence Pop . 
MJMT = J. 
Furthermore, the inverse of a symplectic matrix always exists and can be 
obtained easily with the help of eq. (5.6), as 
JMTJjM =-Î => (- 7j) M-ÍiM---JMTJ. 
M-!--JMTJ. (5.7) 
And from the following simple arithmetic 
5 P urs mane BNE Ze A A PES n 
(ur) IM = (375) 3 (GMT) = SASSA S 


- JMJMTj — 443 = J, 
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it is shown that I~! is symplectic as well: 
A T eu c k 
(ur) Jar = J. 


Now let us obtain the determinant of M following Kauderer (page 10 in 
[36]). Through a series of permutations, we can rewrite J, which becomes 


where Î is the n x n identity matrix. Furthermore, M can be written as 


«X 1 
«-($5). 


where A, B ; C and D are n x n matrices. The symplectic condition becomes 


(ar be) (1 0) (@ 3) = (4 9) 
BT pT} \-1 0/NC DJ. \-T O/’ 
which leads to the relations 
-ÔT Â + ATC = 6, -ĈT B + AT b — f, 
=p ALB OST, DT B + BT b = Ô. (5.8) 


Furthermore we need one more mathematical theorem, which is 
det A0. det A- det D = det(AD) 
ĉ D) i i 


Hence we have 


Ew. ô m 


Furthermore, using the relations det A = det AT, CÁ-! = (AT)—!CT and 
AT D — CT B = I that have resulted from eq. (5.8), we obtain 


det & : ) = det AP det (D — (47-167 B) = ae (47b - 678) = 1. 
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One direct consequence of the symplectic condition and the resulting unity of 
the Jacobian determinant is that the volume of the phase space is conserved 
under Hamiltonian motion, which is known as Liouville's theorem. 


'The detailed study of the relationships between matrix elements is cumber- 
some and can be found in [79]. Here we will restrict our attention to what 
happens in the linear case. Considering the constant part of the symplectic 
condition (5.6), we observe that what contributes via the Jacobian is just 
the transfer map. Let us assume no acceleration and coupling between the 
transverse planes, which entails that the transfer matrix is given by 


(x|r) (xa) 0 0 0 (z|ó) 
p " a o id) dip. 
^ 0 0 yy) (y) 0 (yd ^ DA 
M = m 0 Y D , 
0 0 (bly) (blb) O (b/d) Wu 
(x) (tla) (ly) (b) 1 (Uô) Ly Ly E 
0 0 0 0 0 1 
where 
xe Cs a) y= Ce a) A -( 1 ae 
(zla) (ala) J’ (bly) (blb) / ' 0 1 
a _f 9 (æļô) a .( 9 (o) 
5.-( 0 (a)? 5N o (lay) 
p _ (Hx) (la) ; _ ( (ly) (b) 
t.-( 0 0 f Ly = 0 0 
Plugging into the symplectic condition yields the equation 
T ô Wr J 660 X ô D, J 0 ô 
ó Par | [5349] $255, | es [9:0 (5.9) 
br br ET) NO ô JJ VE, 1, Ê ô ô J 
The left hand side is 
ehane, R ibri e 
e. Praia `P Irie 
DTjE TIe DIJY 4 ETJL, DTID,+ DTD, + BT IB 


After straightforward algebraic manipulation, we have for each term in the 
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left hand side 
RTIX+LTIL, = ^ ele ala) — (ila) (ale) |. 
— [(2]@)(ala) — (aa) (ala) 0 


OT ay PT RR 0 (uly) (blb) — (bly) (ylb) 
arc Vee ( - [(yly) (618) — (bly) (GID) 0 ) 


AT jf AT jf AT 3f 0 1 
DIDID IDE j-(3 ue 


So, the conditions in eq. (5.9) are described as 
(x|x)(ala) — (z|a)(alx) = 1, 
(yly)(blb) — (bly) (ylb) = 1, 
(Ix) + (x|x) (ad) — (a|x)(x|0) = 0, 
(Ia) + (z|a)(a|ó) — (a|a)(x|à) = 0, 
(ly) + (uly) Els) — (bly)(yló) = 0, 
(1b) + (y]b)(b]8) — (b|b)(y|d) = 0. (5.10) 


The first two of these are familiar and describe the fact that the volume of 
phase space is preserved under the linear transformations generated by particle 
optical elements. The other conditions, however, represent the connection 
between longitudinal and dispersive effects. For them to be satisfied requires 
the use of specific scaling factors for the variables | and 6, and they are the 
reason for the specific choice of the variable & in eq. (2.1) in Section 2.1. 
Next let us study the case that coupling between horizontal and vertical 
planes is present, where the Jacobian matrix M isa4x4 symplectic matrix. 
Similarly, we can divide M into four blocks of 2 x 2 matrices, which is 


«(2 


Plugging into eq. (5.6), we obtain 
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which can be simplified as 
detA+detC =1, detB+detD=1, ATIB+CTID =0. 


In general A, B, C and D are not symplectic matrices. Meanwhile, from eq. 
(5.7), we have 


gas ($9 (4 OP) 0 (dárs seni 
=o a) ar ar) Vo a) ^ sera sors 
AC 
5 $) (5.11) 
where A is defined as N P 
A=-JAT J, 


and B, C and D are defined the same way. In addition, we have 


- 0 1 a11 21 01V. az —àiY _ "E 
A=- E 3 Ge =) e 3 7s E 2 — (det A) A. 


Since M`! is symplectic, we have 


det Â + det Ê = 1, det Ĉ + det Ô = 1, AC JOE BTÎD = Î, 


ATIA+BTIB ATIC+ 
CTJA+DTIB CTIC+ 


II 
ATTN 
> & 
e c 


and hence 


obtain a set of important relations 
det Â+ det Ê — 1, det Ô= det Â, detC = det B. 


It is worth noting that there are only six independent constraints from the 
symplectic condition for a 4 x 4 matrix. 


5.2 Differential Algebras 


In this section we will provide an introduction to the theory of Differential 
Algebras (DA) which enables the computation of transfer maps to an arbitrary 
order. For reasons of brevity we only provide a limited overview [4]; a more 
complete treatment can be found for example in [5]. For the sake of clarity, 
we first address the simplest case of Differential Algebras, mathematically 
denoted as the structure ;D}. 
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5.2.1 The Structure ‚Dı 


Consider the vector space R? of ordered pairs (ag, a1), ao,a1 € R in which 
an addition and a scalar multiplication are defined in the usual way: 


(ao, a1) + (bo, b1) = (ao + bo, a1 + b1), (5.12) 
t- (ao, a1) = (t . ao, t $ a1), 


for ao, a1, 60,6; € R. Besides the above addition and scalar multiplication, a 
multiplication between vectors is introduced in the following way: 


(ao, a1) * (bo, b1) = (ao - bo, ao - bı + a4 « bo), (5.13) 


for ao, 41, 69, b; € R. With this definition of a vector multiplication the set of 
ordered pairs becomes an algebra, denoted by 1 Dı. 

In the same way as in the case of complex numbers, one can identify (ao, 0) 
as the real number ao. Where in the complex numbers, (0,1) was a root of 
—1, here it has another interesting property: 


(0, 1) ` (0, 1) 2 (0, 0), 


which follows directly from eq. (5.13). So (0,1) is a root of 0. Such a property 
suggests thinking of d — (0, 1) as something infinitely small, small enough that 
its square vanishes. Because of this we call d — (0,1) the differential unit. 
The first component of the pair (ao, a1) is called the real part, and the second 
component is called the differential part. Using this notation it becomes clear 
that elements of ı Dı can be written as ao + a4: d, and multiplication amounts 
to multiplying the polynomials (ao + a1: d) and (bo + bı - d) and keeping only 
terms linear in d. 

It is easy to verify that (1,0) is a neutral element of multiplication, because 
according to eq. (5.13) 


(1,0) : (ao, a1) = (ao, a1) $ (1,0) = (ao, a1). 


It turns out that (aọ,a1) has a multiplicative inverse if and only if ao is 
nonzero. In case ao Æ 0 the inverse is 


1 a 
(ao, a1)~* = (—.-5) (5.14) 
0 


Using equations it is easy to check that in fact (ao, a1)! - (ao, a1) = (1,0). 
An outstanding result of the methods of differential algebras is that dif- 
ferentiation becomes an algebraic problem, and the differential part of the 
difference 
f(x +d) — f(a) 
equals the conventional derivative. Thus, given any differentiable function f, 
we can compute its derivatives by just evaluating the formula and thus obtain 


f(x) = D[f(z +d) — f(z)) =D [f(z + d)], (5.15) 
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where D denotes the differential part. In the last step use has been made of 
the fact that f(x) has no differential part. Hence Differential Algebras are 
useful to compute derivatives directly, without requiring an analytic formula 
for the derivative and without the inaccuracies of numerical techniques. 

The computation of derivatives shall be illustrated in an example using the 


following function: 
1 


fie m Ur (5.16) 
The derivative of the function is 
1/z22—1 
fo) = =, 
(x + 1/2) 


Suppose we are interested in the value of the function and its derivative at 
x = 2. We obtain 


Now take the definition of the function f in eq. (5.16) and evaluate it at 
2+d = (2,1). One obtains: 
1 1 1 


ND- & 33 a3 - G3) 0/21/85 ~ 6/23/85 
2 2 3 
= (u 6/2). - (8/4) / (5/2?) = (2-3) 


As we can see, after the evaluation of the function the real part of the result 
is just the value of the function at x = 2, whereas the differential part is the 
derivative of the function at x = 2. 

By our choice of the starting vector (2, 1), initially the vector contains the 
value 7(2) of the identity function I : x — x in the first component and the 
derivative of /'(2) = 1 in the second component. 

Now assume that in an intermediate step two vectors of value and deriva- 
tive (g(2), g/(2)) and (h(2), h’(2)) have to be added. According to (5.12) one 
obtains (g(2) + h(2), g'(2) 4- R’ (2)). But according to the rule for the differenti- 
ation of sums, this is just the value and derivative of the sum function (g + h) 
at x = 2. 

The same holds for the multiplication: Suppose that two vectors of value 
and derivatives (g(2), g'(2)) and (A(2), /(2)) have to be multiplied. Then 
according to (5.13) one obtains (g(2) - h(2), g(2) - h’(2) + g'(2) - h(2)). But 
according to the product rule, this is just the value and derivative of the 
product function (g- h) at z = 2. 

The evaluation of the function f at (2, 1) can now be viewed as successively 
combining two intermediate functions g and h, starting with the identity func- 
tion and finally arriving at f. At each intermediate step the derivative of the 
intermediate function is automatically obtained as the differential part ac- 
cording to the above reasoning. 
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An interesting side aspect is that with the search for a multiplicative inverse 
in eq. (5.14) one has derived a rule to differentiate the function f(x) = 1/x 
without explicitly using calculus rules. 

After discussing the algebra ıDı and its virtues for the computation of 
derivatives, we now address the most general Differential Algebra, the struc- 
ture ,D,, which corresponds to the case of power series of v variables to 
the nth order. It will eventually allow us to arithmetically compute partial 
derivatives of functions of v variables through order n. 


5.2.2 The Structure , D, 


We define N(n,v) to be the number of monomials in v variables through 
order n. We will show that 


= = (n vo), 


where C(t, j) is the familiar binomial coefficient. First note that the number 
of monomials with exact order n equals N (n, v — 1). This is true because each 
monomial of exact order n can be written as a monomial with one variable 
less times the last variable to such a power that the total power equals m. 
Thus we have 

N(n,v) = N(n—1,v) + N(n,v — 1): 


the number of monomials in v variables through order n equals the number 
of monomials of one order less plus the ones of exact order n. This recursive 
relation is satisfied by C(n + v, v). Since also obviously C(1 + 1,1) = 2 = 
N(1, 1), the formula follows by induction. 

Now assume that all these N monomials are arranged in a certain manner 
order-by-order. For each monomial M we call Im the position of M according 
to the ordering. Conversely, with M; we denote the Ith monomial of the 
ordering. Finally, for an I with M; = z4 --- at» we define Fy = ii! iul. 

We now define an addition, a scalar multiplication and a vector multiplica- 
tion on R in the following way: 


(a1,..., aN) + (b1,...,0N) = a, d b3,::: ,an + by), 
t- (a1,...,aN) = (t- a1,- ,t- ay), 
(a1,---,an)-(b1,---,bn) = Clsss i4 CN), (5.17) 


where the coefficients c; are defined as follows: 


vb 
psk 3 o (5.18) 


To help clarify these definitions, let us look at the case of two variables 
and second order. In this case, we have n = 2 and v = 2. There are N = 
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C(2 + 2,2) = 6 monomials in two variables, namely 
1, x, y, TE, vy, yy. (5.19) 


As an example, using the ordering in (5.19), we have Izy = 5 and M3 = y. 
Using the ordering in (5.19), we obtain for cı through cg in eq. (5.18): 


cy = a1 - by, 

C2 = Qı + b2 +a2°- b1, 

c3 = a4: b3 + a3 ° b1, 

c4 = 2+ (a1: b4/2 + a9 - b2 + a4: 04/2), 
C5 = 01:05 + a2: 03 + ag + b2 + as bi, 
ce = 2- (a1 - bg /2 + a3 - b3 + ag - 03/2). 


On nD, we introduce a third operation ð; : 


O0, (a1, ..., aN) = (&1,..., CN) 
with 
0 if Mj has order n, 
Ci = 
* T Ed else. 


So ô, moves the derivatives around in the vector. Suppose a vector con- 
tains the derivatives of the function f, then applying 0, to it one obtains the 
derivatives of Of /Ox, through one order less. With this third operation, ;, D, 
becomes a so-called Differential Algebra (DA)[5]. 

While in 1D4, d = (0,1) was an infinitely small quantity, here we have a 
whole variety of infinitely small quantities that have the property that high 
enough powers of them vanish. We give special names to the ones in com- 
ponents J belonging to first order monomials, denoting them by d Mr. In the 
example of 2D2, we have dx = (0,1,0,0,0,0) and dy = (0,0, 1,0,0,0). It then 
follows that instead of eq. (5.15) we obtain 


af Of Of Pf Of 
d = Se E A E bie cd . 
f(x ^- dz, y + dy) (^ Da’ By’ OL?’ Drdy Dy? (x,y) 


In the general case of v variables and order n, after evaluating f in DA one 
obtains: 
Qctisacto tte f 


ETUR de oe 
Oxy Oxy +++ Oxy 


. -- 
(zi ci) 


where Ifi is the index of the monomial zł ---a!, as defined in the 
peewee 


v? 


beginning of the section. 
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5.2.3 Functions on Differential Algebras 


In this subsection we will generalize standard functions like exponentials, 
logarithmic and trigonometric functions to Differential Algebras. As we will 
see below, virtually all functions existing on a computer can be generalized in 
a straightforward way. 

We start our discussion by noting that for any Differential Algebra (DA) 
vector of the form (0,a4,..., aw) € nDv, i.e., with a zero in the component 
belonging to the zeroth order monomial, we have the following property: 


(0,a1,...,an)’ = (0,0,...,0) fori>n, (5.20) 


which follows directly from the definition of the multiplication in nDe, defined 
in eq. (5.17). 

Let us begin our discussion of special functions with the exponential func- 
tion exp(z). Assume we have to compute the exponential of a DA vector 
that has already been created by previous operations. First we note that the 
functional equation 

exp(z + y) = exp(x) - exp(y) 
also holds for DA vectors. As we will see, this facilitates the computation of 
the exponential considerably. 


exp|[(ao, @1, @2,.--,@n)] = exp(ao) - exp[(0, a1, a», . .., an)] 
oo ; n ; 
B (0,21,02,...,aN)' _ (0, 21,02, ...,aN)' 
zi exp(ao) ^ > e 0 Cx exp(ao) x 3 | 03 
= qz 


In the last step use has been made of eq. (5.20) which entails that the 
sum has to be taken only through order n and thus allows the computation 
of the root in finitely many steps. Hence the evaluation of the real number 
exponential exp(ao), which internally on a computer requires a power series 
summation and hence cannot be done accurately, is more subtle than the rest 
of the operations in Differential Algebra. 

A logarithm of a DA vector exists if and only if ag > 0. In this case one 
obtains 


log [(ao, a1, a2, .. . , aN)] = iog fao: IE (02,2... p 


ag ao ao 

ot m ay, Q2 QN : 
Ei Oso Y Cy Pie ee e e 
(log(ao), E , ) 2 ) (02,2, Ax) 
us 141 ay Q2 an t 

= (1 0,...,0)+5 Ibm eae 
(log(ao), E , ) 2 ) (0.22, L3 


Again use has been made of the fundamental property of the logarithm 


log(z - y) = log(z) + log(y) 
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which also holds for DA numbers and leads to simplifications by virtue of eq. 
(5.20). 

As the last example, we will derive a formula for the root function. Even 
though there is a direct method to compute roots by solving a set of linear 
equations for the coefficients of the root, we present here a technique based on 
power series following an approach similar to the exponential and logarithm. 
The root has the following power series expansion: 


vi+r= yoy gË. 
i—0 


Using this formula and the definitions of addition and multiplication in 
(5.17), one directly obtains for the square root of a DA vector: 


ao ao’ ao 


= 1-3---(2i—3) di a2 an 
= : —y : MID 
vao 24 2-4--- (22) (v. ay’ ao ao ) 


Using the addition theorems for sine and cosine, one obtains formulas with 
finite sums in a quite similar way; in general, suppose a function f has an 
addition theorem of the form 


fla+ b) = ga(b); 


and ga(b) can be written in a power series, then by the same reasoning its 
Differential Algebraic extension is computable exactly in only finitely many 
steps. 

In the meantime, there are numerous codes built on the ideas of Differential 
Algebraic methods, including the code COSY INFINITY [7, 50]. 


5.3 The Computation of Transfer Maps 
5.3.1 An Illustrative Example 


Differential Algebras (DA) can be used very efficiently to compute the trans- 
fer map of eq. (2.3) of particle optical systems in its Taylor series representa- 
tion. 

To illustrate this, let us start the discussion with a very simple example, 
the midplane motion in a 90° homogeneous bending magnet. Let x; and 
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FIGURE 5.2: Motion in a 90° homogeneous dipole magnet. The dashed 
arc is the reference orbit. 


à; = sin(o;) denote the initial distance and scaled transverse momentum 
relative to the reference trajectory (see Fig. 5.2). Then we are interested 
in the values x» and a; = sin(o;). Since the trajectories in the magnet are 
circles, we can readily read from Fig. 5.2: 


A = Rsin(a;) = Rai, 


B= Rü - eoo) ea = R(1- yi?) + ti, 


af = sin(aş) = RR 


ny =A- R(1—cos(as)) = A- R(1- A - ad). 


These equations allow the computation of the final coordinates v; and a; 
in terms of the initial coordinates x; and a;. However, taking these equations 
and performing all operations in Differential Algebra allows us to even obtain 
all derivatives of xf and ay with respect to x; and aj. These so obtained 
derivatives, evaluated at x; = 0, a; = 0, are then the expansion coefficients 
of the map in eq. (2.3). For the sake of clarity, let us explicitly show how zy 
and af are computed. 


Using the ordering in (5.19) and identifying the variable a with y, we obtain 
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using the arithmetic defined in eqs. (5.17), 


a; = (0,1,0,0,0,0), a; = (0,0,1,0,0,0), 
A = (0,0, R,0,0,0), B = (0,1,0,0,0, R), 
1 1 
a (o. 5.0.0.0, li xy (0.0.2 E) (5.21) 


Comparing the obtained result with any matrix code, we find complete 
agreement; as an example, the fact that the second component of x; is zero 
implies that Ox ¢/Ox; = 0 and hence (r|x) = 0, which is a well known property 
of 90? bends. 

In case an additional particle optical element is to follow this bending 
magnet, one does not have to start all over evaluating this new element at 
x; = (0,1,0,0,0,0), a; = (0,0, 1,0,0,0), but one can start already with x; 
and ay of eq. (5.21). This way one can save the usually quite involved con- 
catenation process and increase performance significantly. 


5.3.2 Generation of Maps Using Numerical Integration 


In this subsection we will address the general case in which no closed solu- 
tion of the problem exists. We will see that also in this case we are actually 
able to compute transfer maps of arbitrary order for arbitrary particle optical 
elements. Even though we do not have analytical formulas that relate the final 
coordinates to the initial coordinates, there is still a way to computationally 
relate the final coordinates to the initial coordinates, by numerical integration 
of the equations of motion. 

In this case, the final coordinates are still computed from the initial coor- 
dinates using standard arithmetic and functions; however, the relations are 
more complex than in the case of the homogeneous sector. As in any conven- 
tional numerical method, a numerical integrator is used to solve for the final 
coordinates. 

Now blindfoldedly performing all these operations in Differential Algebra 
automatically leads to all desired derivatives of the transfer function, regard- 
less of the form of the equations of motion. In other words, all coordinates 
and fields at any step are power series instead of real numbers. 

Differential Algebraic (DA) techniques have been implemented in many 
programs. They allow the computation of transfer maps of elements with a 
dependence on the independent variable for which an analytic solution cannot 
be obtained. One example is magnets with fringing fields. Another example 
is an electrostatic round lens where the electric field varies with s throughout 
the entire lens. As long as the electromagnetic field can be expressed in a 
differentiable function, the transfer map up to any given order can be obtained 
using the DA technique. 
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5.4 Manipulation of Maps 


In most cases, a beam optical system consists of more than one element 
and it is necessary to connect the maps of individual pieces. Often the inverse 
of a map is needed. Sometimes one part of our system is the reversion of 
another part; therefore, it is time saving if the map of the reversed part can 
be obtained directly from that of the other part. All these map manipulations 
can be done elegantly using DA techniques. 


5.4.1 Composition of Maps 


Whenever a system contains more than one element, which is virtually 
always true, we have to deal with the composition of maps. This is the 
foundation of almost all other map manipulations, as we will see later. 

Let us define o as the symbol for composition. Hence the map M of a 
system consisting of two parts is 


M — Ma o Ml, (5.22) 


where Mı and Mg are the maps of parts 1 and 2, respectively. According to 
Taylor's theorem, My, can be expressed as the sum of a Taylor series and a 
remainder: 


M — Ta Ra, 


where Rn is of order n 4- 1. With the assumption that Mı is origin preserving, 
we have 


[M] = [M2 o Mijn = (Th + Rn) o Mailn = [Tn o Miıln + [Rn o Miaıln 
= [m e Mijn = T (Mila) 


Thus [M]; can be obtained by composing two polynomials, which is called 
concatenation. 

Concatenation is the most frequently used tool in DA calculations. It is 
extremely efficient when a given optical element appears multiple times in 
a system. Instead of computing the map every time it appears, the map of 
the element can be applied to the system using concatenation after the first 
appearance. 

When the exact formula of Mz is known and it is relatively simple, eq. 
(5.22) can be used directly to compute M, spending only a small fraction 
of the time required for concatenation. The saving comes from the fact that 
Mə is not expanded into a Taylor series. In fact, this method has been used 
whenever a element is a drift or a homogeneous dipole because their maps can 
be obtained from simple geometry. 

Another application of the DA concatenator is the transformation of coor- 
dinates among different codes for the study of the dynamics of beams. For 
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instance, while many codes work in the above discussed curvilinear canonical 
coordinates, other codes use the slope instead of the normalized momentum. 
In certain cases, Cartesian coordinates are used, which may be a better choice 
for certain elements, for example, wigglers, but are usually not very well suited 
for a discussion of the properties of beamlines. The transformation of a trans- 
fer map to a different set of coordinates, which can be expressed as 


Mr =C71o Me 9C, 


is quite straightforward using Differential Algebra. One simply composes 
the transformation formulas between the two sets of coordinates, the map in 
one set of coordinates and the inverse transformation in Differential Algebra. 
Thus, one automatically obtains the map of the other set of coordinates to 
arbitrary orders. 


5.4.2 Inversion of Maps 


In this subsection, another kind of manipulation, the inversion, will be 
studied. The first step of inverting a Taylor map is to invert the linear part. 
If it exists, it can be done with any standard linear algebraic package. For 
the nonlinear part the inversion is done in an order-by-order fashion. To the 
second order we can write the map and its inverse as 


M = Mı +M, Mt = Mi 4 Mg. 


Note that the subscript denotes the order of the map. Concatenating those 
two, we obtain 


MoM" = (Mı+Mə2)o (M -. M1) =2 T + Mao. M] M30 M3! = T, 


where the term Mz o M ! is dropped due to the fact that it contains terms 
of the fourth order. As a result, the second part of the inverse is 


M3! 23 Mio M20 Mj". 


Now let us assume that we already inverted the map up to the (n — 1)th order; 
we write the map and its inverse up to the nth order as 


M =n Mi +Mn, MI M1 TM. 
Hence, we have 


M o MC! =n (Mı +Mn) o (MI M5) 
= p T + Mpo MI! + Mio M3! + Mno M7! =, T. 


Since M,, is of the second order and higher, only those terms of the order 
[n/2] or lower in M}! contribute. The result is 


M7! =n —Mi o Mn o (Mil My), 
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where m — [n/2] and m » 1. It is clear that the inverse of a Taylor map 
exists as long as the inverse of its linear part exists. In later chapters, we will 
encounter a number of problems that have to be solved through finding the 
inverse of a map. 


5.4.3 Reversion of Maps 


Throughout the development of beam optical systems, mirror symmetry has 
been frequently used. Recently, mirror symmetric systems are being studied 
in great detail in the search for high order achromats. Among the various 
symmetry arrangements, reversion is the most commonly used. For a system 
which is the reversion of another one, the transfer map is the map obtained by 
going through the system in the reverse direction. The reversed motion can 
be described by first reversing time, i.e., switching all the signs of p; and py, 
then going through the inverse map, and finally re-reversing time. The time 
reversal operation can be performed easily using Differential Algebra, and the 
inversion of the transfer map is done as described in Section 5.4.2. 

Specifically, reversion entails that, if a particle enters and exits the forward 
system at an initial point (£4, aj, yi, bi, li, Ôi) and a final point (xy, ay, yy. b, lg, 
df), respectively, it will exit the reversed system at (£i, —ai, yi, —bi, —li, ĝi) 
after entering this system at (ry, —af, Yf, —bf, —lf, 0g). This determines the 
reversion transformation: 


1.000 0 0 
0—1 0 0 0 0 

" 0 0 1 0 0 0 

R=| ò o 0o21 0 0l 
0 0 0 0 -1 0 
Ü: 0 0 0 0 1 


and hence the map of a reversed system: 
MEF = (R) o M^ o (R71). 
In DA representation, MẸ is computed through concatenation, where 
ME = (R) o M; o (R71). 
In fact, the second composition (Ê) o Mz! 
the signs of the rows for a, b and l. 


An interesting point worth noting is that R is not symplectic. In fact, it 
satisfies the following relation 


can be done by simply changing 


RTJR = —J, 


which we call an anti-symplectic relation. 


Chapter 6 


Linear Phase Space Motion 


In this chapter, we want to study the action of transfer matrices on particles 
by looking in detail to what happens to entire regions of phase space as 
they are transported. This is important because the beam in an accelerator 
is just such a region, and of course we want to make sure that at any time, 
this region is within the beam pipe. 

Let us begin by collecting several observations about two-dimensional trans- 
fer maps M. 


1. M preserves areas. 
2. Different initial points have different final points. 
Continuous curves stay continuous curves. 


Closed curves stay closed curves. 


LEE CEN A 


A point inside a closed curve will stay inside of the closed curve. 


Fig. 6.1 illustrates the second item, and Fig. 6.2 illustrates the other items. 
The first item led us to giving a name, namely “emittance,” to the preserved 
area. The last two items are particularly important, as they tell us that if we 
can enclose our beam within any closed boundary curve, then it is sufficient to 
study the dynamics of this boundary curve alone. It is interesting to note that 
while in the two-dimensional case, closed curves always stay closed curves, it 


M 


Ai, a2, 


Tı T2 


FIGURE 6.1: Mapping of individual points in phase space. 
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ay a2 


Tı T2 


FIGURE 6.2: Mapping of a closed curve in phase space. 


is not generally true that in higher dimensions, closed surfaces stay closed 
surfaces. While this is true for linear higher-dimensional transformation, non- 
linear maps can produce some holes in the surfaces through which particles 
that were initially trapped inside the surface may find a way to escape, which 
is a very important mechanism that can lead to instability. 

If in particular M is linear, then we also have the following observations. 


1. Straight lines stay straight lines. 
2. Ellipses stay ellipses. 


Since straight lines stay straight lines, we may manufacture such a boundary 
curve as a polygon, and to study its motion it is completely sufficient to move 
only the corner points. Alternatively, we may try to enclose the beam by an 
ellipse. Before we follow these ideas, let us first study the action in phase 
space of some simple devices. 


6.1 Phase Space Action 
6.1.1 Drifts and Lenses 


As seen in Section 2.2, the transfer matrix of a drift is given by 


w(i) 


This matrix leaves a constant and moves x by an amount proportional to a; 
hence it performs a horizontal shearing in phase space as shown in Fig. 
6.3. 
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a4 
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aT 


FIGURE 6.3: Action of a drift in phase space. 


A lens has the transfer matrix 


t= (Cart). 


and it leaves x invariant and changes a by a value proportional to x; hence it 
performs a vertical shearing as shown in Fig. 6.4. 


6.1.2 Quadrupoles and Dipoles 


In the case of quadrupoles and dipoles as seen in Sections 4.2 and 4.3, the 
matrices have the following form: 


^ cos @ ksing 
s Nm cos ó i 


This corresponds roughly to a rotation, except that the x and a coordinates 
are also stretched or compressed; the result is a motion on an ellipse as shown 
in Fig. 6.5. In fact, computing the invariant ellipse of the motion following 
the procedure described in Section 8.1.2, we obtain 


Applying to eq. (6.1), we see from a; = 0 that the ellipse is even upright. 
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FIGURE 6.4: Action of a thin lens in phase space. 


FIGURE 6.5: Action of a quadrupole or dipole in phase space. 


6.2 Polygon—like Phase Space 


In order to study the motion of ensembles of particles under linear transfor- 
mations, it is useful to characterize them by certain simple geometric forms 
in which the particles are contained and requiring only few parameters. The 
two most useful such forms are the polygon and the ellipse. 

A polygon in phase space is uniquely defined by its corner points; and since 
straight lines stay straight lines, it is sufficient to study just the motion of 
the corner points. Fig. 6.6 shows the motion of a polygon under a linear 
transformation. 

Frequently a polygon with just four points is chosen; if its lines are initially 
symmetrically arranged around the origin, they will stay symmetrically ar- 
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FIGURE 6.6: Mapping of a polygon in phase space. 


ranged. Furthermore, a four-point polygon with symmetry around the origin 
is a parallelogram, and so parallelograms always stay parallelograms. 

In many cases it is worthwhile to study how the actual beam width changes 
as a function of the s-position along the beamline. The beam width is appar- 
ently determined by the maximum of the horizontal positions of the corner 
points. In the special case in which we consider motion through a drift, each 
of the corner points moves on a straight line. Furthermore, the corner point 
that is furthest out will stay furthest out until it is possibly overtaken by 
another corner point; during the time it determines the beam width, it entails 
that the beam width changes linearly with s. Since the outermost corner point 
can change from time to time, the resulting beam width is piecewise linear 
as a function of s. 


6.3 Elliptic Phase Space 


The other choice that is worth considering is that of an elliptic phase space 
as shown in Fig. 6.7. In this case, the boundary of the phase space satisfies 
the ellipse condition 


yz? + 2ara + Ba? =e, (6.1) 


which can be written in matrix form using a symmetric matrix as 


wa (7 H) er 


For future simplicity, we denote the matrix describing the ellipse by 6. 
We first note that there is a redundancy in the description of the ellipse: 
obviously, doubling the values of a, 8, y as well as € simultaneously leads to 
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FIGURE 6.7: An ellipse in phase space. 


the same ellipse. In order to eliminate this redundancy, we demand that the 
determinant of the ellipse be unity, i.e., 


f — o? — 1. 


With this choice of the matrix, the quantity £ is a unique measure of its area, 
called the emittance. The four quantities o, 8, y and € are called the Twiss 
parameters of the matrix. 

Now we are ready to study the question how the phase space ellipse changes 
as we pass through a system. Let M be the transfer matrix of the system; 
then the coordinates 71, a, are transformed to £2, a» via 


p 
es 


The new ellipse after the system characterized by M must obviously satisfy 


(x2, a2) + 3 - a =e, (6.2) 


and we also have 


a2 


Observe that if we demand det(G2) = 1, even the measure for the occupied 
area € must be the same as before since we know the transfer map preserves 
area. We recall that the old coordinates satisfy 


. x 
(@1,@1)-G1- @ =€ 
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FIGURE 6.8: Characteristic points of an ellipse in phase space. 


Expressing xı, a; in terms of x2, a2, which is accomplished by the inverse 
matrix, we obtain 


(x2, a2) - (COM i ir) b =e. (6.3) 


It is not difficult to show that (M-1)7 -6,- M- is a symmetric matrix, thus 
we conclude that the resulting object is again an ellipse. So ellipses are indeed 
preserved under linear transformation. Furthermore, since the determinant of 
the matrix is unity (see Section 5.1.4), such a representation of an ellipse by a 
symmetric matrix of unity determinant is unique, and because eqs. (6.2) and 
(6.3) hold at the same time, we must conclude that 


oy = (Gr) -ôi MI. (6.4) 


'This equation describes the transformation of the ellipse in phase space 
under the linear transformation. 


6.3.1 The Practical Meaning of o, 8 and y 


As we propagate the beam through a system, the value of 6 changes with 
s, and so do its three characteristic quantities a, 8 and y. It is important to 
study how the three quantities a, 8 and y describe important characteristics 
of the beam. Another important question relates to the shape and degree of 
deformation of the ellipse. Together with the widths, this is characterized by 
the points at which the ellipse intersects the axes as shown in Fig. 6.8. 
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FIGURE 6.9: Sketch of horizontal (solid) and vertical (dashed) 8 functions 
of a beamline. 


The question of axes intersection can be answered readily. In 


yz? + 2axa+ Ba? — e, 


we just set a and x to zero, and obtain 


z Vs a - [5 
0 — y! OSS B 


Now we address the calculation of the maximal points £m and am, which 
characterize the width as well as the maximum angle in the ellipse. To this 
end, we view the elliptic shape as the contour line of a function, and remem- 
ber that the gradient is always perpendicular to the contour lines. Hence 
the maximum position occurs where the angular component of the gradient 


vanishes, and the maximum angle occurs where the positional component of 
the gradient disappears. For the function 


f (x,a) = ya? + 2axa + Ba?, 
we have 
V f = (2*2 + 2aa, 2ax + 20a), 


and we infer that for the maximum position, we must have ax = — pa, namely 
a = —a/f - x. Inserting this into the ellipse yields 


2 
ya? 4 as ( Sa) -e( Sa) =£ => (By — o?) 2? = eg, 
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thus 
EEN 


Because of the symmetry of the equations with respect to interchange of x 
and a, we see that also 
am = VEN. 


So the maximal width in «x direction is determined by the area of phase 
space £ as well as the function f. Thus, 8 plays an eminent role, as it im- 
mediately tells the width of a beam at a given point; and plots of its value 
for different positions around the accelerator are very commonly studied. An 
example of such a plot showing the 8 functions for horizontal (x) and vertical 
(y) motion of a beamline at the Advanced Light Source (ALS) at Lawrence 
Berkeley National Laboratory (LBNL, LBL), California, USA, is shown in 
Fig. 6.9 [54, 21]. 


6.3.2 The Algebraic Relations among the Twiss Parameters 


In this section, we attempt to introduce concept of the phase advance, 
which is the difference in phase between two points on the s-axis, and obtain 
the relations among the Twiss parameters. Let M(s) be the transfer matrix 
of a beamline, which may or may not be part of a periodic transport system, 
from sı = 0 to s2 = s. Let a, 8 and y be the Twiss parameters at s; = 0 and 
a(s), G(s) and q(s) be the ones at s2 = s. From eq. (6.4), we obtain that 


(9 29) = (60) (12) 9. 
or in another form 
CORE EI OR (25). —— e» 


On the other hand, it is straightforward to show that 


VB -a/vB\ (ya\( VB o Y, 
0 1//B j Na BJ AcafvB VB) 


which means that the matrix 


e Oe Se 
—a/VB 1/VB 


transforms the ellipse into a circle. The new coordinates are sometimes called 
the normal coordinates. This equation entails that 


y oy (1/8 a/vB\ (1/vB 0 em 
o B o VB J \a/vB vB} | 
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Plugging eq. (6.6) into eq. (6.5), we obtain 


(ie) (709 cal MvB(s 0 Jsre 


0 8) } \als)/ VBR) VBS 
_ (14B IB (vB 0 
0 vB) \a/vB vB) 


Hence, we have the following relation 


p (sro x pn 


0 1/VB 0 B(s) 
( 1//B(s) 0 )se( VB o0 J-: 
o(s)/ VBC) V/B(s) -a[VB VB) 
Defining 
5). HESS © lg ( ve 0 
CHI 25) TR B ae om 


we immediately have 


or equivalently 


which entails that 


the latter holds due to the fact that det(R(s)) = 1. As a result, we have 


(at) at) = AS) Rl): 


which leads to the relations 


Ris) = R22(s), Ri2(s) = — Rə; (s). 
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Plugging them into the equation 
Rii(s)Re2(s) — Rio(s)Rai(s) = 1, 


we obtain 
Ris) T Ri(s) =1. 


Therefore, the matrix elements can be expressed as trigonometric functions 
of a single variable ¢(s) and the matrix R(s) takes the form 


: cosó(s) sinó(s) 
Ars 5 MAE = d) à e) 


Since R(0) = I, ¢(0) = 0. Plugging eq. (6.8) into eq. (6.7), we obtain the 
explicit form of the transfer matrix, which is 


SET B(s) 0 bis l//B 0 
ANUS P i " me) ee p 
iF VHS 0 Ue) SLO a 
i (s/A/8(s) 1/A/B( —sinó(s) cosQ(s) a/VB JB 
6 


| ([NBSIB (cos ¢s+asin $5) VBB sin bs 
2 mai(s) v/8/8, (cos $s — as sin s) 


where 
1 


TOS TUE 


and as, Bs and $, represent a(s), 8(s) and $(s), respectively. Denoting 


(a, — a) cos ps 4- (12-005) sin ds| 


where each element is a function of s, it is easy to show that 


(v|a) 
B (ala) — a (ala) 


when the values of (x|z) and (x|a) from eq. (6.10) are plugged in. 

From the coordinate transformation point of view, eq. (6.9) illustrates the 
relation between the physical coordinates and the normal coordinates. The 
right most matrix transforms the physical coordinates into normal coordinates 
where the motion in the normalized phase space is a rotation, represented by 
the middle matrix. At end of the transport, the normalized coordinates are 
transformed back to the physical coordinates. 


tan s = 
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FIGURE 6.10: Transformation of an ellipse under a drift. 


For many practical applications, it is useful to explicitly study the trans- 
formation of the ellipse (6.4) through the influence of the matrix M. We first 


observe that if ( | í | 
Ma aa) 


ui - ( 9, 09) 
as simple arithmetic shows. So we have 
T 
^ (nri) .[(191)|,.f(wyy-1| .[ 72 02 
ôa = (M ) e 2 (s ) & 2 
_ ( (ala) 3027] e 3 ( (ala) po 
—(z|a)  (v|z) a, fi —(alz) (xx) 

Performing the calculations, we see first of all that a2, £5 and y2 depend 


linearly on o4, 61 and 7, and hence the relationship can be written in 
matrix form. Explicitly, we have 


then 


b2 (ala -2 (x|x) (zļa) (z|a)* By 

a2 | = | —(2|x) (alz) (ala) (ala) + (zla) (ala) — (zla) (ala) | | ex 

2 (al) —2 (alz) (ala) (a|a)* yı 
(6.11) 


One particularly interesting case is the one where we let an ellipse evolve 
under the action of a drift, as shown in Fig. 6.10. 

If we are interested in the way in which the width of the beam changes, we 
must look at the function (s). For the special case of the drift matrix with 
(z|z) = (ala) = 1, (a|z) = 0 and (x|a) = L, we have 


B(L) = (zlx)* Bi — 2 (x|e) (zla) on + (zla)? y1 = 61 — 2Lea + Z^ 


Qı : a? [21 E 1 
ayn (L-—) -4+f=n(L-—) +>. (6.12) 
yı yı yı 


Linear Phase Space Motion 153 


8 


B 


Q^ 


FIGURE 6.11: Plot of the 6 function in a drift. 


So as a function of L, 8(L) changes quadratically. We also see readily that at 
the point where 


Ic 


, 


yı 


the beam has minimum width, and we have what is called a waist (see Fig. 
6.11). On the other hand, we obtain from eq. (6.11) that 


(L) = fy 


which reflects the fact that the divergence of the beam is not changed in a 
drift. As a result, we have 


&(L) = EDI 


which entails that 
a(L) =0 


at the waist. Meanwhile the behavior of a(L), which is 


vanishes at the waist. Finally, eq. (6.12) can be re-formulated as 


s? 


B(s) = Oba (6.13) 


where (* is the value at the waist and s is the longitudinal distance from the 
waist. 
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6.3.3 The Differential Relations among the Twiss 
Parameters 


Now let us consider the case that s is small and to the first order 6(s) =1 
B+ Bs, als) —1 a+as and ¢(s) —1 $ s. Similarly, the transfer matrix 
becomes 


a 1+ (8/28 o9) s Bó's 
M(s) =; ; , (6.14) 
- (1/8) e + (1 +a?) ¢]s 1- (6'/28 +a¢') s 


where each element Mij, the i, j-th element, is computed as 


Mii(s) =1 B S [cos (o's) + asin (o's)| =,1+ (5 + s) 8, 


Mias) =1 (8 + 8's) Bsin (à 5) =1 Bos, 
Mai(s) 21— aia [a's cos (0's) + (1 +a (a + a's) ) sin (o's) 


=) -+ [a + (1 +a?) e S, 


=,1- (5 tas ) S. 


On the other hand, the transfer matrix to the first order of s can be solved 
directly from the equations of motion, which is 


M (s) =1 (i J ) (6.15) 


where k is the focusing strength at sı = 0. Equating the corresponding terms 
in eqs. (6.14) and (6.15) yields 


2 B O 2 u 
Bo —1, PE 0, — le + (1+a) g] =-k, 

which can be simplified to the familiar form 
, 1 
ó = XS 
B 

o — kB —, 

B ced —2a, 


4 — 2ka. 
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The behavior of ¢, a, 8 and y are determined by the above set of differential 
equations. 


6.4 *Edwards-Teng Parametrization 


Here we have shown an alternative way of propagating the Twiss parameters 
through a beamline or a ring. The advantage is that only matrix multiplication 
and inner products of vectors (rows of a matrix) are used. In the following 
we will show that this way of tracking the Twiss parameters can be readily 
extended to coupled x and y motions. 

The approach is based on the Edwards-Teng parametrization of a four- 
dimensional symplectic matrix. From the relation 


Nn = M RN. ( fco D-snoY/(ÀAÓ Ícosp —D-!sing 
| XN \-Dsing fcosy 0 Dsing Icos i 
we obtain 


M = Acos* y + D! BD sin? o, 
Ñ = B cos? o + DAD“ sin? Q, 
m -—-— (Då — BD) sin o cos o, 


mne (Ab = b^) sin Y cos y, (6.16) 


where A, B and D are symplectic. It is straightforward to obtain 
Â=M-— D-t tang, B = Ñ + Dû tan y. 


Subtracting the second equation from the first equation of (6.16) and taking 
its trace, we have 


tr(M — Ñ) = tr (A — B) cos? y + (DBD — DAD“) sin? e| 


2 (trA- tr B) cos? p + [tr (b^ Bb) ^ 


E 
VEN 
& 
D 
& 
E 
— 
2 
B 
N 
"6 


= (tr A-tr B) (cos? y — sin? p) 


= 2 (cos — cos u2) cos (24) . (6.17) 


In the last step, the assumption that |tr Â| < 2 and |tr B| < 2 is adopted 
following [25], where the matrix M4 is the one turn matrix of a ring. For a 
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generic symplectic matrix, all the conclusions hold after replacing cos pı — 
cos u2 with (tr A — tr B)/2. As a result, we have 


tr(M — Ñ) 


20) = =. 
ene) 2 (cos i4 — cos u2) 


From the fourth equation of (6.16), on the other hand, we have 
n= (â~ — De) sin Y cosy = (AJDT) — JD JB) sin Y cos q, 
and hence 
Jat jT = j (47 b3AT — ETITI) JT sin y cos y 

=— (bà! - 5b) sin q cos y. 

Adding the third equation of (6.16) on both sides, we obtain 
m+ Jats? 2— [ô (4 + A) — (5 + ~) b sin cos o 

=— [Ditx A) — (tr B)D| sing cos o 

= — (cos p — cos u2) sin (29) D. (6.18) 
Finally, we obtain 


m+ Jat jT 


pa ma 
(cos [41 — cos u2) sin (29) 


Now, the only unknown quantity is cos uı — cos u2, which can be obtained 
below. From eq. (6.18), we have 


det G + Ji? jT) = (cos ji — cos ji)? sin? (2p) . 


Adding the square of eq. (6.17), we obtain 


ANNE" det(rh + JAaT JT 
cos yı — cos p2 = 5 tr(M — N) 1 etr Aya 
tr(M — N)/2 


Now let us consider the case that M4 is symmetric, i.e., MT = M, ÑT =Ñ 
and £T = fn. As a result, we have 


m+ IMIT 


DS aa H o 
(cos [41 — cos u2) sin (25) ' 
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and so 

Jr? jT + (IMT JT) jT 

(cos [41 — cos u2) sin (2) 

aoe ee E a 
(cos [41 — cos u2) sin (2y) 


o-(24). 
Boer p 


The relation DT — D- l entails that d = a and c = —b. Combining with 


det Ô = 1, we obtain a? +b? = 1, which means that D is a rotation. Further- 
more, we have 


D-eJgprgt e 


Let us define 


Since det D — 1, we have 


jig (Im? JT. + MT) (det ri) I + Tm 
m ——————————————— a m 
(cos ju — cos u2) sin (2 "m ~ (cos u1 — cos u2) sin (2y)’ 
and 
pa - (+ Jind mt 0 mm? + (det m) 


(cos pı — cos u2) sin(2y) ^ —(cospa — cos u2) sin (2y) | 
Therefore both D~!%m and Df are symmetric and, as a result, A and B are 


symmetric as well. 


6.4.1 The Algebraic Relations with Coupling 


Let us start by noting that the 4 x 4 matrix that describes the beam ellipse 
of the coupled transverse motion is symmetric and symplectic, which is a 
result of the fact that the motion of the particles is symplectic. From the 
argument above, we have 


Wee Ma = Ícoso D- siny å Ô Tcosy —-D-'sing 
| Xm NJ. \-Dsing feosy ô B/\Dsing  fcosp J’ 


where det A = det B = det D = 1. For a symmetric matrix characterizing the 
four-dimensional beam ellipsoid, the parametrization can be written as 


P Ícosp D~! sing f, Ô Î cosg —-D-!'sing 
~ \—Dsing [cos 0 f, Dsing Icosy 
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where TT = T,, ÎT — T, and DT = D-. Using the relation DT = D-!, we 
obtain immediately 


T 
Ícosg D- l sing Î cosy DT sin Y Î cosg E Tsing 
—Dsing Icosy —Dsing Icosy Dsing Icosy 


As a result, we have 


Tv yee <x 
$- Ícosg =p Tsin p T, 0 Ícosg =f ‘sin 
Dsiny Icosy ô T, Dsing Icosy ? 


which means that the coordinate change that block diagonalizes T in the 
linear form also block diagonalizes T in the bilinear form. The matrices T; 
and T} can be decomposed the same way as eq. (6.6) and we obtain 


2! A a s T A ^ ^ x ^ ^ 
[s 0 \ Ícoso 22 Ícosp yen d Ja 4 
ô AZ )\-Dsing {cosg —Dsing Icosy 0 A, ô Î 
6.19 


ian is wis) 
"n — Oi ,y/ Day 1/ Day l 


Similar to eq. (6.9), this equation shows that the matrix 


5 Ícosp D-!'sing Â, Ô 
A4 = Ad ^ Zz ^ 
—Dsing Icosy 0 A, 


transforms the four-dimensional ellipse into two decoupled circles and the 
transfer matrix can be written as 


2 Í cos (92 p sin 3 A a 0 R, 0 
M, = A 7 ees aves 
—Dəsin y2 Icosye2 0 Aye 0 Ry 


AS 0 Í cos Q1 - D7! sin yy 
0 Ax Di sin Í cos pı ' 


i= ( COS Øx y sin zn 


—sindz,y COS dz,y 


where 
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Similar to the two-dimensional case, we have 
Ícos 2 D sin 3 Aug 0 
es sin p2 Í cos (92 0 Ay2 
m Icosy, D7! siny An 0 Rz" ô 
=M4 A 3 n ES 
—Disino Icosqi 0 yl 0 To 


A 
E Ma Í cos pı pe sinqi Aol 0 Bet 0 
Um N Dy sin Í cos 1 0 Ay 0 R 


MÂ COSQ1 —AD Art sini MD Ay sing TRÀ COSQ1 Br 0 
ô RyUS 


B E COSQ1 -Ñ Di Âr singı MDI! A sinyı +N Àj COSQ1 
Defining 
^s eS mui M12 
M Axı cos 91 — nD Agi sin (Q1 = E bo , 
M21 M22 
we obtain that 
V Pz2 0 eae mii M12 cos Ọr2 — Sin r2 
25| 2 a ' 
—os2/ V Bao. 1/V Duo M21 M22 sin Ọr2  cosdz2 
B E cos $45 + M12 Sin Q3. — 41 sin dz2 + M12 Cos $42 


M21 COS r2 + i22 Sindee — 121 Sin «2 + M22 COS Q2 


(6.20) 


Similar to the two-dimensional case, we obtain the relations 


M12 = == 
tan @z2 = Ame COS Q2 = y m111135 — m13131, 
11 


Ri tmi Ti31T1 + M22M12 

B22 = ee) Of). — -—S See 
™m11™22 — ™12™M21 M11M22 — M12M21 

The relations in the vertical plane are the same. Note that both planes give 

the same tilt angle y2 due to the fact that 


det (An COS Q1 — AD Agi sin 1) = det (br Â, sin y + Ñ Â cos 1) ; 


The four-dimensional case reduces to the two-dimensional case when y2 = 0. 
At this point, the advantage of this procedure based on coordinate transfor- 
mation is rather clear since it avoids the coding of complicated formulas which 
are prone to errors. Instead, it divides the task into a few simple and standard 
steps which are finding the normal coordinates of the initial ellipsoid, tracking 
the transformation matrix to the point of interest and finding the Twiss pa- 
rameters at the point of interest. With the help of the Differential Algebraic 
(DA) technique, it is straightforward to include parameter dependence of the 
Twiss parameters, which can introduce beating due to momentum deviation 
or quadrupole errors. 


Chapter 7 


Imaging Devices 


In the following chapters, we will discuss what specifically has to be done to 
the map of a system to make the system useful for a specific task. In many 
cases, this requires that certain matrix elements vanish, or sometimes assume 
specific values. The most important device is probably the imaging device, 
in which final positions are independent of initial angles, as shown 
schematically in Fig. 7.1. Represented in the language of the transfer map, 
this entails that 
(ra) =0, (yh) =0. 


It is apparent the final angles a; and by are unimportant since it does not 
matter at what angle the rays strike at the image position; so all terms of the 
form (a|...) or (b|...) are insignificant. Additional requirements usually exist 
for the various subclasses of imaging systems. 

'There are many types of devices that form images of charged particles. The 
following are a few types that are widely used. 


7.1 The Cathode Ray Tube (CRT) 


The cathode ray tubes (CRT) are a class of imaging that have seen wide 
use in electronic displays, such as the television tube and the oscilloscope. 
As far as practical use, impact on society, and revenues are concerned, the TV 


FIGURE 7.1: Sketch of an imaging system. 
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tube was until recently the most important application of particle optics. In 
this case, for each color an electron beam is deflected vertically and horizon- 
tally by two simple magnetic deflectors in order to sweep over the screen area, 
and the intensity of each beam is adjusted according to the color saturation 
at the respective point. The cathode ray tubes used in the oscilloscopes use 
electrostatic deflection plates to achieve high frequency. Yet the limit of a sin- 
gle pair of plates is around 150 MHz, above which the single pair is replaced 
with segmented pairs of plates. This type of deflector can reach a frequency of 
300 MHz. To reach even higher frequency, a double helix line is used to deflect 
the beam, which has reached 10 to 20 GHz. Nowadays, 33 GHz oscilloscopes 
are commercially available. 

At any given point on the screen, the resulting spot should not be wider 
than the distance between two pixels, so whatever size the beam had initially 
should not be amplified very much; so 


(rjr) and (yly) 


should not be large. 

The requirements for aberrations are usually benign as the phase space 
volume of the beam is small. Yet those aberrations do limit performance and 
over the decades various ways have been developed to minimize them. The 
advent of plasma and liquid crystal displays (LCD) for TV on one hand and 
digital oscilloscopes on the other has caused a precipitous decline of the use 
of cathode ray tubes, which are preferred now only in specialized markets. 


7.2 The Camera and the Microscope 


'The purpose of a camera and an electron microscope is to create an 
image of an object formed by light or particle rays. The quantities 


(rjr) and (yly) 


are magnifications, and in most cases it is desirable to have them equal. 
'The electron microscope is just a special case in which both of them are made 
to be very large to increase the resolution. 

If a true image is desired, it is important that the relationship between 
final and initial coordinates is really linear, which requires that all higher 
order position dependent matrix elements vanish, and so 


(x|rr) =0, (ylyy)=0, (w|xxx)-—0,.... 


In reality, of course, it is difficult to achieve this to higher orders, hence 
some distortion remains. In the case of an electron microscope, this is often 
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FIGURE 7.2: Curvature of the image. 


not detrimental as one can retroactively correct the effects by calculation and 
the resolution is not affected. The effects that appear usually have the con- 
sequence that rectangles are distorted into either the shape of a pincushion 
or into the shape of a barrel; these effects are due to 


(r|vyy) and (y|yzz), 


which entail that rays that simultaneously have x and y coordinates are either 
pushed out from the center (pincushion) or pulled in (barrel). Higher order 
terms in x and y produce similar effects. 

There should be no effect of energy on position, so 


(z|ó) 2 0 and (y|d) =0 


should be maintained. Similarly, all higher order aberrations involving 6 
should vanish; if this is not the case, some color dependent blurring called 
chromatic aberration may occur, in particular for larger values of x and y, 
an effect that can be easily observed in the case of less expensive binoculars. 

There should also be no effects of position on initial angles to higher order; 
so it is necessary that 


(vlad) = (ylaied) = 0, 


and since the range of accepted angles corresponding to a and b is often 
rather large, to correct these terms is often very important. If any of them 
prevail, they will entail a color independent fuzziness; in case the order of the 
coordinates a and b is even, the fuzz will be oriented toward one side like the 
coma of a comet; if the powers are odd, it will lead to a uniformly distributed 
fuzziness. 
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Similarly, all aberrations involving positions and angles simultaneously have 
to vanish, and hence it is necessary to have 


(z|z'* yab) = (y|a'* y'r ab") = 0; 


if any of them prevail, they will entail a position dependent fuzziness that 
becomes stronger with an increase of the positions x and y. 

Interestingly enough, all higher order aberrations depending on a and b only 
linearly can be corrected by a reshaping of the focal plane; in fact, (x|xa), 
etc., produce a tilt of the image, and (x|xxa), etc., produce a curvature of the 
image. Fig. 7.2 shows how the matrix element (x|rxa) can be corrected by 
shaping the image position parabolically. 

At any given position, due to the matrix element (x|rra), any ray with a 
given a is moved up or down in proportion to a, where the amount of deflec- 
tion depends quadratically on x; so the rays arrive at the x plane as shown. 
However, tracing the rays backwards shows that they in fact all intersect be- 
fore the plane, and the point where this happens depends quadratically on z. 
In similar ways, (x|z*a), etc., can be corrected. 


7.3 Spectrometers and Spectrographs 


Spectrometers and spectrographs are devices for the purpose of measur- 
ing momentum, energy, or mass of charged particles. Momentum spec- 
trometers are mainly used in nuclear physics for the determination of the 
momentum distribution of nuclear reaction products. Most of the momentum 
spectrometers known are magnetic because of the fact that the energies that 
need to be analyzed are too high to allow sufficient deflection by electric fields. 
In addition, magnets have two more advantages. They automatically preserve 
the momentum, and can be built big enough to achieve large acceptance. 

The mass spectrometer is mainly used for the analysis of masses of 
molecules, and they can be operated at much lower energies. They have 
a long history, and their applications pervade many disciplines from physics 
and chemistry to biology, environmental sciences, etc. Mass spectrometers are 
also more diverse; among the major types, there are sector field, quadrupole, 
accelerator, energy loss, time-of-flight, Fourier transform ion cyclotron reso- 
nance and ion trap mass spectrometers. 

For all the different types of spectrometers, the goal is to achieve high 
resolution, and in many cases large acceptance at the same time. As 
resolution improves, the need for better understanding and correction of high 
order aberrations increases. In the following, the linear theory of various types 
of spectrometers will be discussed, followed by the studies of aberrations and 
their correction. 
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FIGURE 7.3: The Browne-Buechner spectrograph. 


As alluded to before, in spectrometers the final position is used as a 
measure of momentum, energy, or mass of the particle. This requires that the 
final position be independent of other quantities, which in particular requires 
that the device be focusing such that 


(xja) = 0. 


Due to Liouville's theorem, it is impossible to obtain focusing and zero image 
size simultaneously, and hence the initial spot width has to be minimized to 
ensure that the final image is narrow enough. Furthermore, the dependence 
on the spectroscopic quantity of interest ó, the so-called dispersion 


(|o), 


should be large. Finally, in a mass spectrometer where particles of different 
energies are present, the dependence on energy (x|ó) should vanish. In the 
map picture, the linear behavior is thus given by the transfer matrix of the 
horizontal motion 


: (rr) 0 — (vl) 
M = | (alz) (ala) (ald) | . (7.1) 
0 0 1 


Let 2D; be the width of the source. From eq. (7.1), it is clear that the par- 
ticles to be detected focus around a spot at (x|ó)ó with a width of |2(x|x) D;]. 
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FIGURE 7.4: Illustration of the imaging condition arising from Barber’s 
rule. 


Hence, the distance between the centers of particles of different energies must 
be larger than the width, i.e., 


((0)0] > |2(a|x) Di]. 


This sets an upper bound for 1/6, which we call the linear resolving power (or 


linear resolution), 
NE _ |_ (ul) 
in 3) 7 neos , 


Hence in order to increase the resolution, it is necessary to increase |(z|ó)| 
and/or decrease |Dj|. 

As an example, let us study the first broad range momentum spectrometer, 
the Browne-Buechner spectrometer. It contains only a homogeneous dipole 
with 90? bending and circular pole boundaries. The layout is depicted in Fig. 
7.3, and it is applicable for particles with energies up to 25 MeV/u. As it 
turns out, there is a simple condition known as Barber's rule that assures 
that the system is x-focusing. This is in fact the case whenever the source 
location, the center of deflection of the magnet, and the image location lie on 
a straight line, as shown in Fig. 7.4. To prove Barber's rule, we first write 
down the transfer matrix of the horizontal plane, which is 


M. - 1 ls cos 0 Rsin 0 11 
T NU —(1/R)sin@ cos 0 1 


7 [o up d (. 2 


— (1/ R) sin cos 0 0 1 
[$9509 — (Io/R)sin@ (l +12) cos0 + (R — lilo/ R) sin0 
i — (1/ R) sin8 cos 0 — (l; / R) sin8 


where the angles 0, a; and ag are shown in Fig. 7.4 and the quantities R, 
lı and lə are the bending radius and the drifts before and after the dipole 
magnet, respectively. Using the relations lı = Rtano;, l2 = Rtanos and 
a, T 032 +0 = 7, as well as tan A + tan B = (1 — tan A tan B) - tan(A + B), 
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TABLE 7.1: The first order map of the Browne-Buechner 
spectrograph (Exponents in the initial variables x, a, y, b, L, ô) 


Xe af Yf br exponents 
-1.000000 -1.950458 0 0 100000 
0 -1.000000 (00 0 010000 
0 0 1.000000 0 001000 
0 0 1.830747 1.000000 000100 
0 0 0 0 000010 
0.519441 0.506574 0 0 000001 


we have 
hl . 
T2 = (L + l2) cos + R-— R sin 0 
= R|(tan o, + tan oa) cos0 + (1 — tano, tan a2) sin 0] 
= R (1 — tano tan o) cos - [tan (ay + 02) + tan 6] 
= R(1 — tano; tano) cosÓ - [tan (m — 0) + tan 6]. 
Because of tan (7 — 0) = — tan 0, we obtain the desired result 


mı2 = 0. 


In general, it is not hard to obtain the first order map by hand, and it is 
very easy to obtain it using a computer code; the result is shown in Table 7.1. 
With the typical assumption that the half width D; is 0.25 mm, the resulting 
linear energy resolution is 


E S 9 


—— — — | = 1000. 


Since all electric and magnetic devices produce nonlinear terms in the map 
called aberrations, their impact on the resolution has to be studied when- 
ever necessary. The nonlinear effects are very important in the case of the 
momentum spectrometers due to their large angular and momentum accep- 
tances. Considering the aberrations, the final width will be a new value Azab 
instead of |(x|z) D;|, which has as an upper bound 


AT = (2|(z|x)Di| + (x|z?)| D? + |(v|xa) D; i| +-+), 
where A; is the half width of the spread in the quantity a. So, the actual 


resolution HR, is 
z|ó 


AXab r 


A parameter often used as a comprehensive quality indicator for a spec- 
trometer is the so-called Q value 


m Q In (Pmax/Pmin) 
= In2 i 
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— 


FIGURE 7.5: Sketch of a generic spectrograph consisting of a single dipole, 
including rays that show the imaging condition and the dispersion of the 
device. 


where f? is the nominal solid angle from which a reasonable resolution is 
expected. The Q-value shows both the geometric and momentum accep- 
tance of a spectrometer. For example, the Q and pmax/Pmin for the Browne- 
Buechner spectrometer are 0.4 msr and 1.5, respectively. Large 2 translates 
into high intensity, which is important for nuclear studies and other situa- 
tions where the number of available particles are small. Large momentum 
acceptance can reduce the number of exposures to cover a certain momentum 
range. 

As discussed before, the purpose of the spectrograph is to translate energy 
information into position information, and in order to have high resolution, 
the position should not depend on anything else if possible. Rays originate 
from a source, travel through the spectrograph, and finally reach the screen, 
as shown in Fig. 7.5. 

It is possible to measure energies in terms of final positions by making the 
dispersion 


(x|6) 


large. In practice this requires the use of at least one bending element, 
because all other elements have vanishing (|). The final position should not 
depend on anything else besides ô, and since it is important to be able to 
accept rays covering a wide range of angles, it is necessary to have 


(x]a) = (y|b) = 0. 


So the spot size is limited by (a|”) = 1/(a|a), which is usually kept small, 
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FIGURE 7.6: Sketch of a generic spectrograph as in the previous picture, 
but now subject to aberrations up to order seven. 


and the size of the object, the x size of which is usually kept in the range of 
fractions of mm. 

Any contribution to the final position should be due to energy, and so aber- 
rations depending on initial angle should be avoided. These aberrations are 
usually called spherical, since they historically first manifested themselves 
from the grinding of lens surfaces as spheres, which is much easier to achieve 
than other shapes. So if possible we want 


(zx|aa) = (zx|aaa) — 0, (a|bb) = (x|abb) = 0. 


These conditions are not satisfied for the simple spectrograph shown in Fig. 
7.5. Rather, when they are considered, the trajectories of the rays look like in 
Fig. 7.6, showing very noticeable broadening of the image due to aberrations. 

The aberrations involving also x-positions are less significant as positions 
are kept small. The ones involving also y positions are more important as y 
is not necessarily kept small; but if (y|y) is kept large enough, particles with 
significant initial y reach the focal plane with significant final y; the interplay 
of (y|y) and (x|yy) then leads to a parabolic shape of the resulting image, but 
the sharpness of the parabola, which determines the resolution, is unaffected 
by (v|yy). 

It is also important to consider aberrations involving energy. Of these, the 
terms depending only on energy of the form 


(z]ó*^) 


do not necessarily have to be corrected as long as they are known, since they 
just turn the relationship of final x and initial ô into a nonlinear one, which 
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FIGURE 7.7: The effect of the aberration (x|aó). 


still allows an accurate measurement of 6. The most important aberrations 
are usually those that involve initial angles and energies simultaneously, as 
both of these can be large. Of these, the lowest order aberration (v|aó) can 
be corrected by a simple tilt of the focal plane: the final x of a particle, 
which depends mostly on 6, is moved up or down linearly depending on the 
value of a. As shown in Fig. 7.7, similar to before, all these rays with different 
values of a go through a common point at a distance before or after the x 
plane, where the effect of (z|aó) does not manifest itself. The tilt of the focal 
plane is also very clearly visible in the actual example of Fig. 7.6. 

In a similar way, spectrographs can also be used to measure masses of 
particles, and all previous arguments remain valid if the energy deviation ó 
is replaced by the mass deviation ôm. If mass resolution is to be achieved to 
very high precision and the initial energy is not uniform, then in addition to 
the above requirements, it is also important that the final position does not 
depend on 6; this requires that 


(0) = 0, 
while of course at the same time trying to have 
(|n) 


large. The simultaneous satisfaction of these conditions is not possible using 
only magnetic devices; for low energies, it is usually achieved by combining 
magnetic and electric deflectors. 


7.3.1 Aberrations and Correction 


For all the spectrographs mentioned above, nonlinear effects have always 
been a concern for the designers. Looking back to the Browne-Buechner spec- 
trograph, the linear energy resolution obtained was ~ 1000. When aberrations 
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FIGURE 7.8: A magnified drawing of the effect of the aberration (x|aó). 
(Az, 21) is the image on the tilted focal plane caused by the term (x|aó). 


are considered, the resolution of the eighth order drops sharply to around 60, 
which is far below the actually achieved resolution. This shows the importance 
of the aberrations. They have to be studied carefully, and the prominent ones 
have to be corrected. 

Since the entrance slit D; is usually small and the solid angle large, only 
the angle and dispersion aberrations are important. Both map calculations 
and geometric considerations show that all the terms (x|z"* a") (m + n even) 
vanish. Since (a|b?) is small (see Table 7.2), the only second order term 
that has a strong impact is (z|aó) which can be as large as 8 mm when a = 
40 mrad and 6 = 20%. In fact, this is the most important factor that causes 
the decrease of the resolution. This becomes apparent when the resolution 
considering (x|aó) is calculated: 


(z|6) 


Rab = 2(r|aó)a;ó 


= 63. 


Fortunately (x|aó) is easy to correct, because it only causes the tilt of the 
focal plane, which is illustrated in Fig. 7.8. 

To prove the last statement, suppose a particle of energy Ko(1 + ô) starts 
from the origin with slope ao and goes through an angle focusing system. The 
final position and angle to first order are 


m = (m) a1 = (ala)ag + (a|5)5, 
respectively. Taking into account (a|ad), the result becomes 
Tı = (x|) + (z|aó)aoó, G1 = (a|a)ao + (a|ó)0. 


Consequently, the system is not focusing anymore. Now consider a second 
particle of the same energy starting from the same point but with a different 
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TABLE 7.2: Aberrations of the Browne-Buechner 
spectrograph at the straight focal plane that are 10 jum or 
larger (Parameters: £maz = 0.23 mm, amaz = 40 mrad, 
Ymax = 1mm, 6544 = 10 mrad, maxz = 2096. Exponents: 
in the initial variables x,a, y, b, 1,6) 


# Coefficient Order Exponents 

1 -0.2300000000000000e-3 1 10 00 00 
2 0.1038881145421565 1 00 00 01 
3 0.4660477149345817e-4 2 10 00 01 
4 0.8311049163372523e-2 2 01 00 01 
5 -0.5127000000000000e-4 2 00 02 00 
6 -0.1025577239401090e-1 2 00 00 02 
7 -0.6562560000000000e-4 3 03 00 00 
8 0.3324419665349009e-3 3 02 00 01 
9 -0.1873001321765092e-2 3 01 00 02 
10 0.1038881145421565e-4 3 00 02 01 
11 0.1544932687715805e-2 3 00 00 03 
12 0.4654187531488613e-4 4 03 00 01 
13 -0.1338622665642800e-3 4 02 00 02 
14 0.4380445064894873e-3 4 01 00 03 
15 -0.2577160791614789e-3 4 00 00 04 
16 0.4622130002260186e-4 5 02 00 03 
17 -0.9970354551245065e-4 5 01 00 04 
18 0.4511716453676560e-4 5 00 00 05 
19 0.2219821460952441e-4 6 01 00 05 


angle ao + Aao. The differences in final position and angle between the two 
particles are 


Ax, = (r|aó)A^agó, Aa, = (ala)Aao. 


The fact that Azi/Aa, is independent of Aap indicates that particles of 
energy Ko(1 + ô) are focusing at 


which is proportional to ô. So the tilting angle is 


Az  (m|aó) 
ny m = (aep 


where w is the angle between the normal to the focal plane and the z-axis. 
Furthermore, the correction of (r|aó) even increases the resolution under 
certain circumstances. When Azap is smaller than the detector resolution 


Axq, Avg becomes the limitation of the momentum resolution and is inde- 
pendent of v. Since the distance between two peaks increases by a factor of 
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TABLE 7.3:  Aberrations of the Browne-Buechner 
spectrograph at the tilted focal plane that are 10 wm or 
larger (Parameters: £maz = 0.23 mm, amaz = 40 mrad, 
Ymax = | mm, 65,44 = 10 mrad, maxz = 20%. Exponents: 
in the initial variables x,a, y, b, 0, ô) 


Coefficient Order Exponents 
-0.2300000000000000e-3 1 00 
0.1038881145421565 1 
-0.9320954298691634e-4 2 
-0.5127000000000000e-4 2 
0.1079501821087351e-1 2 
-0.6562560000000000e-4 3 
0.3324419665349009e-3 3 
-0.2105079060488440e-3 3 
-0.1038881145421566e-4 3 
.1654199766297046e-2 3 
-0.1329767866139604e-4 4 
0.5138469075870278e-4 4 
-0.2242022047923136e-4 4 
0.1745846601755523e-3 4 
0.1305877080464639e-4 5 
0.2771149616039620e-4 5 


HG HG n| HG pÀ|bHGmDo 
O Oi! 4 Q MN /Í O (o 0 -H Oo 0 A wN e|% 
o 

OOooooooooooooceoce 
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OOooOoooooooooooooo 
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1/cosw while Avg remains unchanged, the resolution is 


which is greater than the linear resolution. Rigorous computation of the actual 
resolution requires that the aberrations on the tilted focal plane be calculated. 

For the Browne-Buechner spectrograph, eighth order maps of both straight 
and tilted focal planes are computed. Table 7.2 shows the aberrations on the 
straight focal plane, where (x|a9) is clearly the major contributor. Table 7.3 
contains the aberrations on the tilted focal plane, where (r|aó) vanishes and 
others are either reduced or unchanged. Both tables are taken from [5]. The 
resolution after the cancellation of (x|aó) bounces back to 780 (or 1560 in 
momentum), which is quite close to the linear resolution. This entails that 
the remaining aberrations are weak enough to be ignored at the time when it 
was first designed. 

When higher resolution is required, more aberrations have to be corrected. 
Usually (x|a?) and (x|b?) are corrected first. Then (z|a?), (z|a?6) and (2|ab?) 
are tackled. If necessary, fourth order terms like (x|a*) also have to be min- 
imized. At least in one instance, eighth order terms, such as (r|a?b?), are 
corrected, too. This is done in Quad-Dipole-Quad (QDQ) spectrographs. The 
pole faces of the last quadrupole are shaped to produce quadrupole, sextupole, 
octupole and decapole field components. Generally, corrections are done by 
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placing magnetic multipoles of the same order into the system. They are ei- 
ther separate adjustable multipoles or combined fixed elements to dipoles or 
quadrupoles. 


7.3.2 Energy Loss On-Line Isotope Separators 


As part of the growing developments in the study of radioactive beams, on— 
line isotope separation is more and more widely performed. As in other 
mass spectrometers described above, different isotopes have to be laterally 
separated. Yet they can no longer be bent by electrostatic sectors anymore 
due to their high energy. Among the different methods, the energy loss method 
is a very interesting one. First, particles of the right rigidity are selected by 
a slit at the dispersive focal point. Second, the selected particles are sent 
through an energy degrader which creates new momentum spread according 
to the mass of the particles. And finally, a second slit picks up the desired 
nuclei. The best spatial separation can be achieved when the whole beamline 
is achromatic and the degrader preserves the achromaticity. This is due to 
the fact that nuclei of the same mass but different momentum are focused at 
the same spot. Hence it is important to study the transfer map of an energy 
degrader and the achromatic conditions. 

In most of the cases, the degrader is thin enough for us to neglect the 
straggling effect from multiple scattering. So a degrader is the combination 
of a drift and an energy loss, which has the first order matrix 


1 d 0 
Ma = 0 1 0 
(J|x)a (0]a)a (l)a 


Here d is the thickness of the degrader. It is easy to show that the spatial 
part of M4 can be reduced to a unity matrix. After applying a negative drift 
behind the degrader, Ma becomes 


. 1d 1 d 0 1 0 0 
Ma={0 10 o 1 o0 |=| 0 1 o 
0 0 1/ \Gle)a (ó]a)a (616)a (d|x)a (3]a)a (813)a 


For heavy ions at the intermediate energy region of > 10 MeV /u, the energy 
at the exit can be described with the formula 


1/y 
Ka=K (1-2) ; 
R 


where K is the energy at the entrance of the degrader and R is the range. 
Furthermore, 


R = kA TK YZ, 
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FIGURE 7.9: Layout of an example of a fragment separator. There is 
an intermediate image at the mirror symmetry plane in the middle, and the 
system is achromatic. 


where A is the atomic number, Z is the charge, and k and y are constants. 
Within this model, the matrix elements can be obtained as 


E Od 
Olea = cd RD or 
(dla)a = 0, 


(5|6)a : 


— I= dis 


where dg and Ro are values for the reference particle. 
To achieve achromaticity, the system has to satisfy 


(zla) = 0 and (z|) = 0. 


By denoting the parts before and after the degrader with subscript 1 and 2, 
respectively, the achromatic conditions are obtained and they are 


(x|x)2(2]a)1 * (z|a)2(a]a)1 +(x|6)2{(4|@)a(zla)1  (9|a)a(a]a)1) = 0, 
(x|2)2(2|d)1 + (|a) (a]0)1 + (]6)2{(4|x)a(w|d)1 + (01a)a(a|0)1 +(6|4)a} = 0. 
When the previous model applies, (ó|a) vanishes. Together with the require- 


ment that both parts are focusing, that is (z|a)1 = (x|a)2 = 0, the conditions 
can be reduced to 


(zļa)ı = (v|a)a = 0, 
Dı Mə + Do(Di(6|x)a + (6|6)a) = 0, (7.2) 
where D = (z|ó) and M = (a|z). 


An example which uses an achromatic degrader for isotope separation is 
shown in Fig. 7.9. When operated on the achromatic mode, the system is 
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mirror symmetric about the degrader and forms a dispersive image at that 
point. The matrices of the two parts are 


! M 0 D ] 1/M, 0 -Dj/M 
M4 = (ala) 1/Mi 0 ; M» = (a|z)1 M4 — (a|z)1Di1 
0 0 1 0 0 1 


From eq. (7.2), the achromatic condition for the fragment separator is 
Dy, (d|x)a + (6]6)a = 1. 


Therefore the shape of the degrader can be decided, which is a wedge with 
the slope 

Od = ydo 

Ox E Dı ` 


It is straightforward to verify that the system with the degrader is indeed 
achromatic 


(0|9)a 0 0 
M = Mo- Ma- Mı = | My(alx),(2—(6|x)gD1) 1 0 
M,(5\a)a 01 


7.4 *Electron Microscopes and Their Correction 


The field of electron optics is one of the oldest branches of beam physics, 
which is a direct descendant of light optics. Recently, it is also one of the most 
active branches due to the advancement of aberration correction in electron 
microscopes. In the past decade, the TEAM project (Transmission Electron 
Aberration-corrected Microscope) has been developing the next generation of 
electron microscopes. Initial experiments using the latest aberration-corrected 
scanning transmission electron microscope (STEM) demonstrated the scien- 
tific potential of aberration-corrected electron microscopes. 

Since their invention in the early 1930s, electron microscopes have been used 
in various areas ranging from scientific research to industrial production, and 
various different types of microscopes were developed for the specific needs 
of those applications. The main variants are the transmission electron micro- 
scope (TEM), the scanning transmission electron microscope (STEM), the 
photoemission electron microscope (PEEM), the low energy electron micro- 
scope (LEEM) and the scanning electron microscope (SEM). 

Among them, TEM and STEM are used mainly to study the bulk proper- 
ties of materials with electron energy ranges from 100 keV to 1 MeV. PEEM, 
LEEM and SEM are used to study surface properties of materials with elec- 
tron energy below 30 keV. In a PEEM, secondary electrons generated by 
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AK/K > 0 
AK/K =0 
AK/K <0 


FIGURE 7.10: Spherical (top) and chromatic (bottom) aberrations. A 
Gaussian image exists at the dashed line. Higher and lower energy electrons 
are represented by AK/K » 0 and AK/K < 0. 


photons are imaged. In a LEEM, electrons reflected from the sample surface 
are imaged. In a SEM, an electron probe the size of a few angstroms is formed 
on the sample and secondary electrons are collected. 

Except for LEEM, which needs a magnetic separator to separate the in- 
coming and reflected electron beams, most microscopes without aberration 
correction consist of so-called round lenses only. There are two types of 
round lenses used in electron microscopes: the electrostatic and the magnetic 
lenses. Electrostatic lenses are used in PEEMs, LEEMs and some SEMs, 
whereas magnetic lenses are used in TEM and STEM where the higher 
energies of the electrons make the use of electrostatic lenses impractical. 

The rotational symmetry of these lenses ensures that only a small number 
of aberrations remain to degrade the linear or, so-called, Gaussian image 
properties. The first kind are the spherical aberrations, which lead to 
a blurring of the image due to the opening angle of the electron beam at 
the object. In the electrostatic case, the first relevant terms are (x,aaa) 
and (y, bbb), which are equal due to the rotational symmetry, and usually 
denoted by Cs. There are also terms of the form (z, a?) and (y, 0?) which 
are denoted by Cs, the significance of which is discussed below. Second and 
fourth order terms and cross terms like (x, aab), etc., vanish because of the 
mirror symmetry. 

In the magnetic case the situation is a bit more complicated since magnetic 
round lenses can rotate the image in the x — y plane. However, considering 
the motion in the rotated coordinate system, it can be seen that the matter 
reduces to quite the same situation as in the electrostatic case. The top picture 
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of Fig. 7.10 illustrates the effect of the spherical aberration. 

The second kind of aberrations are the chromatic aberrations, which 
arise as a combination of the opening angle and the energy spread of the 
beam. The lowest order chromatic aberration for a round lens is 2, and in 
the electrostatic case has the form (z,aó), which because of symmetry also 
equals (y, bd). These chromatic aberrations are usually denoted by Cc. In the 
magnetic case after transformation into the appropriate rotated coordinate 
system, the situation is again the same. The effect of the chromatic aberration 
is illustrated in the bottom picture of Fig. 7.10. 

Even in the early days of electron microscopes, the possibility of correcting 
the remaining aberrations had been contemplated. Yet the initial result of 
theoretical investigation was not very encouraging. Scherzer [62] showed that, 
for around lens without reflection, the spherical and the chromatic aberrations 
do not change sign, the same as the focusing force of such a lens (see Section 
4.4.1). 

Specifically, electrons with a larger angle are focused stronger and electrons 
with higher energy are focused weaker. As a result, aberration correction 
requires violation of the above assumptions, through using either multipole 
elements, electron mirrors or time varying fields. Early attempts on aberra- 
tion correction, between the late 1940s and the early 1990s, failed mainly due 
to insurmountable technical difficulties. Hence the development of electron 
microscopes up to the early 1990s follows mainly the line of aberration re- 
duction through optimization of the lens design and improvement of stability. 
'The initial success of aberration correction came when the technology was 
ready in the mid-1990s [59, 39]. 


7.4.1 Aberration Correction in SEM, STEM and TEM 


The first successful aberration correction was reported in 1995, where the 
spherical aberration C's and chromatic aberration Co were corrected in a low 
voltage SEM (scanning electron microscope). The corrector consists of four 
multipole elements (see Fig. 7.11), which was originally proposed in the early 
1960s. The two outer elements are electrostatic multipoles and the two inner 
ones are superimposed electrostatic and magnetic multipoles. The corrector 
consists of two identical quadrupole doublets, where the two quadrupoles 
are physically the same, excited at the same current but with opposite polarity. 

Furthermore, it is arranged such that the so-called cosine-like ray of the 
horizontal plane, which in conventional transfer map terminology corresponds 
to the (z, x) matrix element, goes through the center of the left inner element, 
while that of the vertical plane goes through the center of the right inner 
element. This entails that (r|aó) and (y|bd) can be corrected independently 
from each other. 

More importantly, rays in the vertical plane coincide with those in the 
horizontal plane going backwards. This layout minimizes the breaking of ro- 
tational symmetry due to the introduction of multipoles. The most noticeable 
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FIGURE 7.11: Cosine-like rays (left) and sine-like rays (right) of the 
quadruplet corrector. 


FIGURE 7.12: A quadrupole-octupole C's corrector. The rectangles rep- 
resent quadrupoles and the hexagons represent octupoles. 


consequence is that the terms (x|a9) and (y|bó) are equal, restoring the rota- 
tional symmetry of the chromatic aberration. The superimposed electrostatic 
and magnetic quadrupoles form first order Wien filters that can correct 
chromatic aberration. 

In other words, the different energy dependencies of the electrostatic and the 
magnetic forces allow adjusting the chromatic aberration while maintaining 
overall linear focusing. In addition, the rotational symmetry of the spherical 
aberration is partially restored. Specifically, for a rotational symmetric sys- 
tem, we have (z|a?) = (2|ab?) = (y|a?b) = (y|b?). For the present corrector, 
the relations among the four terms are (x|a?) = (y|b?) and (a|ab”) = (y|a?). 
'These relations show that two families of octupoles are needed to correct the 
spherical aberration using a corrector with the same symmetry. 

For this corrector, the octupole components of the inner multipoles correct 
the terms (r|a?) and (y|b?), and those of the outer ones correct (z|ab?) and 
(y|a?b). With Cs and Cc corrected, the resolution of a 1 keV SEM could 
reach below 2 nm. 

Meanwhile, another scheme was developed and used to successfully correct 
third order spherical aberration in a 100 keV S'TEM. where a certain combi- 
nation of quadrupoles and octupoles is used. It uses a similar layout for the 
quadrupoles as the Cs and Co corrector above, which is shown in Fig. 7.12. 
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The linear optics consists of two identical quadrupole doublets with equal 
spacing between the quadrupoles and equal strength of all quadrupoles. The 
two outer octupoles correct the terms (x|a?) and (y|b?), and the middle one 
corrects (z|ab?) and (y|a?b). Due to the large difference in transverse position 
of the horizontal and vertical rays in the outer octupoles, the two knobs are 
mostly orthogonal. A resolution of 0.78 A has been achieved using such a 
corrector. 

While the introduction of the C's corrector into an electron microscope 
corrects the third order spherical aberration, it also generates much larger 
fifth order spherical aberrations (C5) through the combination of the 
objective lens and the octupoles and that among the octupoles, which becomes 
the limiting factor as the resolution reaches toward 0.5 À. The equation below 
illustrates the origin of C5 through combination. 


C) = oue) oer clea)? Lectt) 
E Ce zi + (ala) 2) 


(a|z) x; + (ala) a; 


(z|a) koa} 
i PT korx? + kon ((2|2) x; + (ala) aj + (xla) — 


., (el i Gla) a; 
(ala) zi + (ala) a 


a ( (x\a) korz} 
((ala) kor + ko? (zlay*) x? 4-3 (z|z)? (zla) kor kone? J ` 


Since Cs is proportional to (x|a), it vanishes when (r|a) vanishes, i.e., when 
the first element (right) is imaged onto the second one (left). It can be seen 
from Fig. 7.12 that this condition is not met for this corrector. More recent 
designs of Cs correctors have taken this into account and correct C5 as well. 
By adjusting the image location, the value of C5 can be varied and canceled. 

In order to correct Cs in a transmission electron microscope (TEM), extra 
attention has to be paid to maintaining a large usable object area, the so-called 
field of view. This usually requires that at least 2000 image points are well 
resolved in one dimension. It turns out that the simple quadrupole-octupole 
corrector shown in Figs. 7.11 and 7.12 does not meet this requirement. The 
main reason is that the cosine-like ray of the objective lens, i.e., the sine-like 
ray of the corrector, goes through the octupoles at large amplitude. As a 
result, it is deflected by the octupoles, generating large aberrations that limit 
the field of view. 

The first successful corrector Ca on TEM consists of two round lenses and 
two sextupoles, which is shown in Fig. 7.13. The round lenses form a so- 
called —I transport between the centers of the sextupoles, i.e., the linear 
transfer matrix is a negative identity. It turns out that this cancels the second 
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FIGURE 7.13: A sextupole Cs corrector. The ellipses represent round 
lenses and the rectangles represent sextupoles. 


FIGURE 7.14: Sine-like and cosine-like rays of a Cs corrected TEM from 
objective lens to the end of the corrector section. OL: objective lens, Lı to 
L4 : round lenses, Hı and Ho : sextupoles. 


order aberrations generated by the sextupoles as well as C5 from combination. 
Furthermore, the third order spherical aberration can be corrected due to the 
fact that Cs from the sextupoles, which is proportional to k?, is rotationally 
symmetric and of the opposite sign of that of the round lenses. 

Fig. 7.14 shows such a corrector in a TEM, together with the objective lens 
and the so-called transfer lenses. Note that the cosine-like ray of the objective 
lens goes through the centers of the sextupoles, hence it is unaffected by the 
corrector, helping to maintain the field of view. A slightly modified version of 
such a corrector has also been used to correct Cs in STEM. Recently, a STEM 
named TEAM 0.5 (Transmission Electron Aberration-corrected Microscope) 
has achieved the resolution of 0.5 À at 300 keV using such a corrector. 

With the success of correcting the spherical aberration in TEM, scientists 
and engineers in this field have set out to build a TEM that is both Cs and 
Co corrected. Successful as it is, the sextupole corrector is not capable of 
correcting Cc and it is not obvious how to modify the sextupole corrector to 
include Cc correction. As a result, attention has been focused on the option of 
a quadrupole-octupole corrector. After many attempts, Rose [59] developed 
a design which satisfied the requirement and was later adopted by the TEAM 


182 An Introduction to Beam Physics 


FIGURE 7.15: Sine-like and cosine-like rays of the TEAM corrector. El- 
lipses: round transfer lenses, rectangles: multipoles. The focal length of the 
middle elements is half of that of the far outer ones. The ratio of the sine-like 
rays at the middle elements is 5. 


FIGURE 7.16: Sine-like and cosine-like rays of the TEAM microscope from 
objective lens (OL) to the end of the corrector section. The omitted part in 
the middle is the multipole corrector shown in Fig. 7.15. Lı and Lə : adapter 
lenses, Oz : octupole used to cancel (x|ab?) and (y|a?b). 


project and built. 

As shown in Fig. 7.15, the corrector consists of two multipole quintuplets, 
each replacing one sextupole in the sextupole corrector (Fig. 7.13). The mid- 
dle element of each quintuplet is a superimposed electrostatic and magnetic 
multipole which is responsible for correcting the spherical and the chromatic 
aberrations. Each quintuplet is mirror symmetric about its center and each 
half is again mirror symmetric about its own center. Each half of the quin- 
tuplet is point to parallel and parallel to point. Each quintuplet is a —I 
transport. The result is the cancellation of a large number of aberrations. 
Since one of the strengths of quadrupole families is a free parameter, it is 
chosen such that the relative difference in the horizontal and vertical beam 
width at the center of the quintuplet is large. As in the case of the sextupole 
corrector for TEM (Fig. 7.14), the cosine-like ray of the objective lens is not 
affected by the aberration corrector (see Fig. 7.16). The second family of 
octupole can be placed either at the center of the corrector or, as shown in 
Fig. 7.16, after the corrector. 
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FIGURE 7.17: Spherical (top) and chromatic (bottom) aberrations of an 
electron mirror, showing the possibility of reversing the sign of that of a regular 
round lens. Vertical dashed line: Gaussian image. Higher and lower energy 
electrons are represented by AK/K > 0 and AK/K < 0, respectively. 


Such a corrector posed unprecedented challenges on technology in terms 
of tolerance on alignment errors and power supply stability. The required 
tolerance on alignment error is around 14 um between adjacent elements, 
which is difficult but achievable. For the superimposed multipole elements 
which are responsible for aberration correction, the noise level of the current 
and voltage supplies have to be below 1.5-1078 (AI/|I|) and 4-1078 (AU/|U |), 
respectively. This level of stability was unheard of even a few years ago. Yet 
recently, it has been possible to achieve AI/|I| = 8.1- 107? and AU/|U| = 
3.6 - 107°, fulfilling the design criteria. A first test of the corrector showed 
that the resolution of a TEM with this corrector reached 1 A. 


7.4.2 Aberration Correction in PEEM and LEEM 


Due to the low energy of the electron beam (« 30 keV) in these devices, 
so-called electrostatic lenses that use electric fields to focus the beam are 
feasible. Although the multipole corrector used in low voltage SEM (scanning 
electron microscope) successfully corrected the spherical and the chromatic 
aberrations, it is not suited for PEEM (photoemission electron microscope) 
or LEEM (low energy electron microscope) which requires large field of view. 

A sophisticated multipole corrector similar to the TEAM (Transmission 
Electron Aberration-corrected Microscope) corrector may be sufficient, but 
as it turns out there is a much simpler alternative, namely the electrostatic 
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FIGURE 7.18: Geometry of the tetrode mirror in PEEM3 at Lawrence 
Berkeley National Laboratory, California, USA. 


FIGURE 7.19: Layout of PEEM3. Square: beam separator, ellipses: elec- 
trostatic round lenses, the mirror is on the bottom. 


mirror. The reflection in the mirror makes it possible for a mirror to generate 
spherical and chromatic aberrations with the opposite sign of those from the 
regular round lenses. 

The top picture in Fig. 7.17 shows that an electron with large initial angle 
is reflected at a location where the slope of the field line is smaller than the 
initial angle and can be focused less. The bottom picture shows that an 
electron with higher energy penetrates deeper into the mirror, is reflected at 
a location where the slope of the field line is larger than that for an electron 
with design energy and, as a result, can be focused more. 

Therefore, an electron mirror with a dent on the reflection electrode com- 
parable to the electron beam size can form the desired field distribution for 
aberration correction. Fig. 7.18 shows an example in PEEMS3 at Lawrence 
Berkeley National Laboratory, California, USA, which is an adaptation of the 
SMART design, and the layout of PEEMS3 is shown in Fig. 7.19. SMART 
is a project of SpectroMicroscopy for All Relevant Techniques in Germany. 
The dots behind the surface denote charge rings used for numerical simula- 
tion. The first electrode from the right physically ends roughly at z — 33 mm. 
There are four electrodes used to provide tuning of the focal length, the spher- 
ical, and the chromatic aberrations. 
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FIGURE 7.20: The beam separator of the first aberration-corrected 
PEEM. The path of the reference electron is shown by the dash-dotted curve. 
(From H. Rose, Geometrical Charged-Particle Optics, Springer- Verlag, Berlin, 
2nd ed., 2012, © Springer-Verlag Berlin Heidelberg 2009, 2012 [60]. Chap- 
ter 9, Correction of Aberrations, Fig. 9.12, “Cross section of (a) the fourth 
quarter of the beam separator showing the double symmetry of the fields and 
the curved optic axis and of (b) the entire separator. The shaded areas repre- 
sent the regions of the dipole field perpendicular to the pole plates. The sign 
and the strength of the dipole field differ for regions with different shading; 
the dash-dotted curve represents the optic axis." With kind permission from 
Springer Science and Business Media.) 


Although the electron mirror itself maintains the rotational symmetry, a 
magnetic beam separator is needed to guide the electron beam to the detector 
downstream of the mirror, thus breaking the rotational symmetry of a con- 
ventional PEEM. Consequently, the most challenging part of an aberration- 
corrected PEEM/LEEM is the beam separator whose own aberrations have 
to be small compared to the existing ones. 

The first aberration-corrected PEEM was built at Technische Universitat 
Darmstadt, Germany, in the 1990s and was installed at BESSY II, the Berliner 
Elektronenspeicherring-Gesellschaft für Synchrotronstrahlung in 2001. Re- 
cently it achieved a resolution of 3 nm. 

Its layout is similar to that shown in Fig. 7.19 up to the exit of the beam 
separator since the former has an energy filter downstream of the beam sepa- 
rator. The mirror column forms a so-called —J transport, which ensures that 
the cosine-like ray turns back on the axis and is unaffected, maintaining a 
large field of view. The beam separator shown in Fig. 7.20 is a square magnet 
with 90? bending and three axes of mirror symmetry (0 — 27.5?, 45? and 
62.5? for each pass). The resulting optical system is an achromat with +I 
transport, i.e., its transfer matrix is an identity matrix, and it is free of all 
second order geometrical aberrations. 

The drawback of this separator is the difficulty in building this device to 
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FIGURE 7.21: The PEEM3 beam separator. The square, the ellipses and 
the rectangles represent the magnet, the electrostatic round lenses and the 
electrostatic quadrupoles, respectively. 


FIGURE 7.22: Sine-like, cosine-like and dispersive rays of the PEEM3 
beam separator in the x-z plane (left), and in the y-z plane (right). 


the tight machining tolerance and in tuning it during operation due to the 
complexity and rigidness of the design. The fact that focusing is produced 
primarily by the edges entails that the slope of the grooves and the details of 
the field near the electron path are critical to the quality of the image. 

The selection of high permeability material to fit the field distribution to 
the analytical model leads to magnetic material which is very soft and thus 
difficult to machine. As a result, the second project of an aberration-corrected 
PEEM, built at Lawrence Berkeley National Laboratory, turned to a simpler 
separator design shown in Fig. 7.21, and the rays are shown in Fig. 7.22. 
Since the magnet is a simple 90? sector bend, round lenses, with the help 
of electrostatic quadrupoles, provide the focusing. There is only one axis of 
mirror (0 = 45?) for each pass. The system is a so-called —I transport for 
each pass with no zero dispersion at the end. An achromat is formed after 
two passes. 
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FIGURE 7.23: Aberration-corrected and energy-filtered LEEM at IBM. 
Images of the sample are formed on the diagonal lines of the prisms, which are 
shown by squares. (Reprinted from Ultramicroscopy, v. 110, R. M. Tromp, et. 
al., A new aberration-corrected, energy-filtered LEEM / PEEM instrument. 
I. Principles and design, p. 852-861, Copyright (2010), with permission from 
Elsevier [68].) 


Between 2007 and 2011, a third aberration-corrected PEEM/LEEM was 
designed and built at IBM [68]). The layout is shown in Fig. 7.23. Magnetic 
prisms are represented by squares, and electrostatic and magnetic round lenses 
are represented by ellipses. Fig. 7.23 also shows cosine-like rays (drawn in dark 
gray, denoted “field ray") and sine-like rays (drawn in pale gray, denoted “axial 
ray") from the sample. The beam separator consists of two 90? prism arrays 
and an electrostatic round lens. It restores the double mirror symmetry of 
the first separator but greatly simplifies the design and manufacture by using 
only commercially available components. The prism behaves like a round lens 
to the first order and is mirror symmetric, which entails that the dispersive 
ray forms a virtual image at the center. As a result, one round lens between 
the two prisms is sufficient to make the separator an achromat and transfer 
the image from the center of the first prism to that of the second one. 


Chapter 8 


The Periodic Transport 


In the case of the periodic transport over long distances, the desire is not 
so much to give a special shape to the beam as the beam exits, but, even 
much more simply, to just contain the beam. This is of key importance 
in all devices in which the beam repeatedly passes through the same (or a 
very similar) structure. We may wonder whether this again translates into 
the requirement that a certain matrix element vanish, but as we shall see, this 
is not quite the case. 

Actually it is rather straightforward to formulate a necessary condition on 
the linear matrix: it is not allowed to have any eigenvalue of magnitude greater 
than unity. If the eigenvalue is real, the argument is simple: if this were the 
case, any particle that has its coordinates lined up with the corresponding 
real eigenvector will after one period end up on the same line, but all its 
coordinates would have increased by a factor equal to the eigenvalue. 

If on the other hand the eigenvalue is complex, there is another eigenvalue 
that is conjugate and hence has the same magnitude. Similar to the eigenval- 
ues, also the eigenvectors are conjugates of each other. Now simply consider 
the sum of the two eigenvectors, which is real; sending this sum through the 
matrix multiplies the first eigenvector by the first eigenvalue, and the second 
one by the conjugate, resulting in a sum that is again real and increased in 
size by the magnitudes of the eigenvalues. 

In both cases, coordinates grow exponentially in time, and so eigenval- 
ues that are even only a tiny amount above unity in magnitude are detrimen- 
tal. Of course the nonlinear effects also influence the motion and break the 
purely exponential pattern, but all experience shows that it is not possible 
to correct linear instability with nonlinear means; in practice, things usually 
work quite to the contrary. 


8.1 The Transversal Motion 
8.1.1 The Eigenvalues 


Because of emittance preservation due to Liouville’s theorem, the fact that 
eigenvalues greater than unity are prohibited means that, in fact, all eigen- 
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FIGURE 8.1: Motion in phase space (left) and in the eigenspace (right) 
when |tr M| » 2. 


values have to have unit magnitude. Of these, the cases +1 and —1 are 
to be excluded too, since even the slightest imperfection in the machine may 
otherwise lead to instability. Altogether, in a periodic system, the eigenvalues 
must all be complex and of unit magnitude. 

It is particularly interesting to study the special case of a matrix with 
midplane symmetry. In this case, the x and y motion decouple and can be 
described by individual matrices. We obtain for the eigenvalues for the 2 x 2 
x sub-matrix, noting that the y sub-matrix is treated similarly: 


(rr) -À (ala) 
(alz) | (aja) = ^ 
= (z|x) (ala) — (ala) (alz) — A [(x|x) + (ala)] +”, 


o- [A -a| = 


and so 


Hence to have complex eigenvalues requires the very simple condition 
—2< tr <2. 


A quick check of the four cases shows that this excludes the point-to-point 
case and the parallel-to-parallel case, as in both of these, the trace just equals 
two or exceeds two. The parallel-to-point or point-to-parallel case each have 
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FIGURE 8.2: Relation between the phase space variables and the eigen- 
vectors. 


one element on the diagonal vanish, so they are permissible if the remaining 
diagonal matrix element is less than two in magnitude. 

We also verify that for tr M # 2, the eigenvalues form a reciprocal pair, 
i.e., AVAg = 1. Let us quickly revisit the case | tr M| > 2, for which the 
eigenvalues are real, and hence one of them is greater than unity, and as we 
had already concluded, the motion is unstable. Choosing a new basis, the 
so-called normal form basis, along the real eigenvectors vı and U2, we have 
that the repetitive motion asymptotically approaches the eigenvector vı with 
eigenvalue greater than unity and becomes larger and larger (see the right 
picture in Fig. 8.1). 

A detailed analysis shows that the motion indeed follows a hyperbola; 
note that A;A2 = 1, and that |À1| > 1 > |A2|. Suppose we have a general 
vector expressed in the basis (71,02) as shown in Fig. 8.2 whose coordinates 
are now o and f (not to be confused with the Twiss parameter), and thus 


E E MR z 
= (Z) = a5 + 50. 


Applying the transfer matrix, we have 
Mz = oM, + BMU = aM Tı + Brod». 
In normal form coordinates, the action of the transfer map is thus given by 
(a) = (o X) (3) 
Ba] NO 22] NBJ" 
but since Ap = 1/A1, the product of the coordinates stays constant, character- 


istic of the motion along a hyperbola. In Cartesian coordinates, the motion 
looks more complicated as the hyperbolic structure is deformed (see the left 
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FIGURE 8.3: Relation between the eigenvalues when | tr M| « 2. 


picture in Fig. 8.1). For practical purposes, this case is unstable and hence 
useless. 

Let us now consider the case |tr M| < 2 in more detail. We have the 
complex eigenvalues that satisfy 


Ag = Xi and A= AE 


So in the complex plane, 1 and A» lie on a circle and form conjugate pairs, 
as shown in Fig. 8.3. 
'The eigenvalues can hence be written as 


— acis 
M2 =e , 


where p is called the tune of the system, which is 


(= + &) (sz) 
jt = arccos — = arccos 2 . 


The eigenvectors U;,2 belonging to A1,» also form conjugate pairs, since 


Miz = Müz = Aot = A13. 


Define now two new basis vectors y} = Re (v1), V. = Im(v1) as the real 
and imaginary parts of the eigenvalues; they define what is called the normal 
form basis for stable motion. So we have 


We now observe 


MV, =, = e (U, + W_)= cosu-V, —sinp-U.. +i (sin u- U} + cosu: U), 
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FIGURE 8.4: Motion in phase space (left) and in the eigenspace (right) 
when |tr M| < 2. 


and similarly 
M; = à202 = e~™ (U, — dU .) = cos pU} —sin p-U. —i (sin u- V4 + cos p- U). 


Now assume we have a general vector expressed in the basis vectors v4. with 
coefficients a and f, i.e., X = avi + BU... Then we have 


Mz = oM, + BM. 

„të jV, — Mi, + Mos ,M-HM 
1a gy a a 2, g% Mi 
2i 2 2i 

a (cos u - U4 — sings: 0.) + B (sinu: U4 + cos p- v_) 


II 


(acos u + B sin u) V4. + (~a sin u + B cos u) 9... 


So altogether, in normal form coordinates, we have 


^ b) 7 ( 21 _ ( COS Li an (3) 
M = : = ; 
B —asin u + B cos u — siny Cosh BJ 
and thus the transformation M simply performs a rotation as shown in the 
right picture in Fig. 8.4. 

The angle of the rotation in normal form coordinates is simply equal to the 
tune p; and it is completely obvious that the motion is stable. 

To obtain the motion in the original Cartesian coordinates, we have to 
subject the circles to a linear transformation, which turns them into ellipses; 
so the motion looks as in the left picture in Fig. 8.4. The angle by which 
particles move in the original x, a coordinates is not necessarily u anymore; but 
we can conclude that indeed if we look at the average angle advance over many 
turns, then this average converges to the tune p, as at least the number of full 
revolutions that were experienced must agree in both coordinate systems. 
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FIGURE 8.5: Possible movement of the eigenvalues under small perturba- 
tion near | tr M| = 2. 


It is also very illuminating to consider what happens if the system is sub- 
jected to some small errors, which in reality of course always appear. If the 
eigenvalues were far enough from unity, even under small errors we still have 
ài = Ao and Ai = Ne and while the tune u may have changed a little, the 
qualitative behavior of stability is totally unaffected. So as long as we main- 
tain that | tr M/2| < 1 is maintained, stability prevails. If on the other hand 
the perturbation is so large that this is violated, the perturbation can lead to 
the loss of stability as shown in Fig. 8.5. 


For the sake of completeness, let us consider further the case of | tr M | = 2. 
In this case, 41,9 = 1, and 


M — f. 


This motion is stable; but under the slightest perturbation there is danger of 
becoming unstable, and hence this case is practically useless. 


8.1.2 The Invariant Ellipse 


For many practical purposes it is particularly important to know in detail 
the parameters of the ellipse that is invariant under stable linear motion. For 
this purpose, let A1,.2 = e*'", and choose the sign of the tune u such that 
sign(u) = sign((x|a)). We then define three parameters o;, 3; and »y; as 


(z|a) — (ala) 


xla a|x 
Qi = 7 , Bi = ( | , | 
2 sin uj sin Li 
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As we shall prove now, these three parameters describe the invariant ellipse 


vila 
x T Yi Qi T 
6) Ga) (eo 


where the matrix describing the ellipse is called Î. To prove that T is actually 
invariant, we first express the transfer matrix in terms of the parameters. To 
this end, we observe that since 


tr M 
A1,2 = 3 ae 


we have that 
(ala) + (ala) = tr M = X + Aa = e" +e = 2 cos p. 
From the definition of a;, we have (|x) — (ala) = 2sin pi - a;, and hence 
(z|z) = cos ui + aisin ui, (ala) = cos pi — a; sin pi. 
On the other hand, from the definitions of 8; and yi, we have 
(r|a) = Bisinpi, (alz) = —% sin pi, 
and so altogether 


yy = ( cosi + a, Sin ij Dj sin pi 
P —^yi sin Hi COS Hi — ou sin pu J ` 


Letting 


we have " E 
M = I cos u; + K sin pi. 


Computing the inverse map of M , we find 
M! = I cos p; — K sin pu, 


where we used |M| = 1, and as a consequence piyi — a? = 1, which we infer 
as follows: 


1 = |M| = (cos ui + a; sin pi) (cos pi — a; sin pi) + Biyi sin? pi 
= cos? pi + (Bii — o2) sin? pi = 1 + (—1 + biyi — o2) sin? pi, 


but since u; was not allowed to be zero or m because of our requirement of 
stability, we must have iyi — a? = 1. 
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We now are ready to study whether indeed the ellipse defined above is 
invariant under M. This is the case if whenever a particle satisfies the ellipse 


equation 
T 
£ [1i Qi ] x —1 
OR ee. 


their image under MI, which is given by 


also satisfies the ellipse equation. This means that also 


T 
p] GA) Be Qr 
a Qi Di a 
This is the case if and only if 
MT.T.M-T, 


since every ellipse is described by a unique symmetric matrix and M? - T - M 
is indeed symmetric. In order to execute the matrix multiplications necessary, 
we study various matrix products; let 


We then have 


Now we are ready to compute the product MT . T - M. We obtain 
MT.T.M- (cosi; + RT sin ui) T (cos p + R sin pu ) 
= (i COS Ui + KT sin mi) (7 COS Lj + J sin pi) 
= T cos? p; + J sin pi cos p; — J sin pr; cos ui + T sin? p; = T, 


which is indeed what we needed to prove. To conclude we remark that there 
is not only one invariant ellipse, but even every ellipse that can be generated 
by stretching or shrinking from the original one is invariant. So altogether, we 
have a nested set of invariant ellipses, and particles will always stay contained 
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FIGURE 8.6: Stable motion in phase space. 


on the invariant ellipse on which they are originally lying, as shown in Fig. 
8.6. 

'The last important question remaining in this section is to put into per- 
spective the parameters of the beam a, 8, y and the parameters o;, Bi, i 
describing the invariant ellipse of one turn accelerator. Are these Greek let- 
ters equal, are they related, or do they have nothing to do with each other? 
This is actually a question that often throws off even die-hard accelerator 
physicists, and it is very much worthwhile to understand it in depth. 

Regarding their origin, these two sets of parameters are actually totally 
independent. In fact, one describes some property of an accelerator, and 
the other describes a property of a beam; and of course we can feed any type 
of beam into a given accelerator. 

However, if the goal is to fill the accelerator in the most efficient way, as 
it turns out this is accomplished if the beam's Twiss parameters agree with 
those of the accelerator. In this case, after one revolution the phase space will 
occupy exactly the same area (although the individual particles in it are at 
different positions), as shown in Fig. 8.7. 

On the other hand, if one injects a beam with an ellipse that does not agree 
with the invariant ellipse of the accelerator, then the repetitive behavior of 
the beam ellipse shown solid in Fig. 8.8 is determined by the shaded invariant 
ellipse it touches. 

As we go around the repetitive system repeatedly, the beam ellipse stays 
within the invariant ellipse and touches it, but, depending on the tune, will 
have a different orientation. In fact, if the tune is not rational — something 
desirable for stability reasons — over time even all different orientations will 
occur. If we now want to operate the accelerator, we have to make sure we can 
handle everything inside the invariant ellipse, leading to considerable waste 
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a 


FIGURE 8.7: Illustration of the case where the beam ellipse (dashed) and 
the invariant ellipses (solid) are matched. After each revolution, the beam 
ellipse is exactly reproduced. 


of area. 


So it is best to operate a repetitive system in such a way that the beam 
ellipse is matched to the accelerator’s invariant ellipse, and to avoid mis- 
matching, the so-called beating. 


8.2 Dispersive Effects 
8.2.1 The Periodic Solution 


Let M be the transfer matrix of a periodic cell, 


The periodic solution characterized by D, D’ satisfies 


D (x|x) (ala) (2d) D 
D' | =| (alz) (ala) (als) | | D' |. (8.2) 
1 0 0 1 1 
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a I 


FIGURE 8.8: Behavior of a mismatched beam. 


Thus 
(a) - gn E e , (8.3) 
(D-E C98) (0) 

when 


det (' = ehm 205) = 2— (zjz) — (aja) £0. 


Note that when tr M « 2, which is satisfied if only stable motion is considered, 
D, D' are uniquely determined. 


(>) "zw (Mls) Eea) (in) 
= I (1 — (a]a)) (|) + (zla) (alô) 
2 — (ala) — (ala) (G — (z|æ))(alð) + (aso ) l 


)4 
)4 


From the point of view of the computation, a slightly different form of the 
same result helps to develop an algorithm that can be applied to arbitrary 
order with the help of the Differential Algebraic (DA) technique. Eq. (8.2) 
can be rewritten as 


100 D 0 (x|z) (ala) (x|ó) D 
010 D'|-[0]|-2|(a|x) (ala) (ald) D'|, 
000 1 1 0 0 1 1 
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and hence 
D (z|z)-1 (æla) (x|5)N ^ (0 
D' |= (ajx) (ala) — 1 (alo) 0 
1 0 0 1 1 
Defining 


we obtain the more compact form 


D een f0 
D! = (M — in) 0 
1 1 


It is straightforward to generalize the above equation to the case of a nonlinear 
map. Here the equation to be solved is 


D (6) D (3) 
D'(3) | - Me | D'(5) J, 
ô ô 


where D (6) and D’ (6) are polynomials of 6 without the constant part. Similar 
to the case for the linear map, we obtain 


D (6) D (6) 
D' (8) | =(M-Tx) o | D'(à) |. 
ô ô 
where 
100 x 
Ty =1010 a 
001 ô 


As described in Section 5.4.2, the map (M — Zy )™* can be obtained using the 
Differential Algebraic (DA) technique order-by-order up to any given order. 
'Thus the periodic solution of dispersion up to arbitrary order can be obtained 
without loss of accuracy. 


8.2.2  Chromaticity 


Now let us turn our attention to the betatron tune of off-momentum par- 
ticles. We know that higher momentum particles are bent less, hence focal 
length of quadrupoles is longer. As a result, we expect that total phase ad- 
vance decreases as momentum increases. Recall that 

pid9B 
p Ox 
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and when p — po 4- dp, we have 


q By q OB, 1 
= ——— = — Oe — Z> k 1-6 , 
p Ox po Ox 1+ dp/po 1 Ko ( ) 
fp = 
po Ox Po 


For a small distance ds, we have 


1 0 
ait = (gas 1) 
j 1 0 
War ee i 


a avro f| cosluo) + o sin(uo) B sin(uo) 
M = dMdMg ^ ( sint i COS( Ho) — uy) 


i ( — (k ET ds d) ae a yo 
= ( 1 i) eee + asin(uo) Bsin(uo) ) l 


dkods 1 —ysin(uo) cos(uo) — a sin( uo) 


And by defining 


we have 


Using the normalization transformation 


d E TT anea 90. 0 
a/VB VB -ajVB iB)" 


we obtain the transfer matrix in the normalized space as 


D 0) 4-1. 4 fcosluin) + esin(uio) sini) — yg 
( ) Pig d : —ysin( uo) cos(uo) — un) A 


A 
= e 1) e sin en 
~ VóBkods 1 —sin(uo) cos(uo) 


cos( uo) sin(uo) 
óBkods cos(uo) — sin(uo)  cos(uo) + óBkods sin(10) 


1 x 1 
cos(u) = 5 tr (sr) = cos( uo) + 29 Bkods sin(/10), 
1 
L = poo + du => cos(u) = cos(uo) — du sin( uo) => du = —39Bkods 


= dv y- =—7 6 Bkods = £n = JT PLE 
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The last step keeps only the leading order effect. The quantity £w is called 
natural chromaticity. 

It will likely lead to beam loss since certain off-energy particles lie on reso- 
nant tunes. To remedy this problem, it is common to use sextupoles, because 
they do not affect tunes of on-momentum particles but at the same time 
provide quadratic nonlinearity that can be used to compensate the natural 
chromaticity. The magnetic field of a sextupole is 


By = b2 (2? — y^) , By = 2boxy, 


where b2 = —3 M3 3. Defining ks = —b2/xmo and denoting 


COS [be +QzSiN Hy Bx Sin Hg 0 0 
^ —^ys Sin [Ly COS [ly — Os Sn Hy 0 0 
M — . . , 
0 0 COS [ly + QySiN [ly By sin Hy 
0 0 —Yy SiN fy COS fly — Ay SIN [ly 
we obtain 
X1 X To 
Pe 
ay | _ a+ keds (a - 4?) o | ar | % 
yı y Yo 
bi b — 2k,dsxy bo 
x (cos He +QzSiN Hg) zo + (Bx sin Hz) ao 
a+ kads (a? — y?) — (Ye sin i) Lo + (cos Hg — AzSiN Hz) ao 
[9] 
y (cos Hy +aysin fy) yo + (By sin py) bo 
b — 2k,dsxy — (Yy sin Ly) yo + (cos fy — o sin fy) bo 


Again, to make the physical picture clearer, let us apply the normalization 
transformation in four-dimensional space, which is 


x l/V/B, 0 

a Ox] V/ Bs V Bx 
y y 0 0 1//B, 0 
b b 0 O cog Bu. x Bg 


0 0 
0 0 


c c ag 8 


Vis 0 0 0 
A-} E —aüs/ A Br 1/48. 0 0 
VB, 0 
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Denoting the linear normalization transformation (as opposed to the trans- 
formation matrix), its inverse and the linear transfer map as 


£ x £ 
A-ÁA|^|, At*=A7/°], m=M]|°], 
y y y 
b b b 
we obtain 
Tı x 
a ksds (x? — y? 
Hy = |Ao oS s(x v) oA"! o[Ae Mo A71] 
yı y 
bi b — 2k,dsxy 
x 
a+ ksdsy Be Bux? — B y’ 
= : i ) x R(Hz, hy), 
b — 2ksdsy Ds byzy 
where 
Rime) 0 5 
Hz a ^ cosu sin 
Rus, Hy) = | p ) y , R (us) = eo n : 
R (py) b 


Since the kick of a sextupole is second order in the coordinates, the off- 
momentum particle has to go through it off the magnetic center in order 
to affect the linear motion and hence the tune. In other words, the dispersion 
has to be nonzero at the location of the sextupole to correct chromaticity. 
With the presence of dispersion, the coordinates are 


r—r—D;,ó, a—a-— D,, 
and the normalized coordinates are 


fo P D; Es ES. 1 
T>T-— i à — à — y B,D,ó. 
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The one turn map in the new normalized coordinates is 


Tı £+ 67?” Deb T god. DÀ 
4 i 
@ |_| a+pipie |a kds (Baa? — Bu?) | | a— aipis 
n y y y 
E i 
by b b — 2ksdsBe Byxy b 
z+ bz? D,ó z— pz? D,ô\ (o 
L ay lj ü 
o| a+ Bz D,6 oR(us, ly) a— br Dz |, ae 
y y Yo 
b b bo 
HH To 
2 a 
a+ ksdsv/Bs [Bz (x — Daô/ vB)" — byv” ag 
y 2 R(x, Hy) © Yo 


b — 2ksdsy/ Buby (x — Dô / / Bs) y bo 


The reason that the constant part vanishes after the rotation is that D, and 
D, are periodic solutions of the ring. Keeping only the linear part, we obtain 
the transfer matrix 


M —28kds| “PePe 
0 
and the chromaticities due to the sextupole 
1 
Eres = yp Bs (s) Ds (s)ks(s)ds, 
T 
1 
Pss ELE (s)ds. 
In summary the total chromaticities are 
1 
& =~ fist es) — Di) (3) ds, (84) 
1 
& = =: ds Dus) Ds) ()] d (8.5) 
where k,(s) and k,(s) are quadrupole strength in the x and y planes along 
the ring. Usually two families of sextupoles are used to correct chromaticities 
in both planes. In order to make the two knobs more efficient and orthogonal, 


one family is placed at locations so that 5, is large and 8, is small and the 
other family at locations so that the opposite is true. 
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Eqs. (8.4) and (8.5) are very useful for the design process to determine 
where the chromaticities are generated and where the best locations are to 
place the sextupoles for correction. Yet computing them together with the 
higher order terms become almost trivial with the Differential Algebraic (DA) 
technique. Recall that the tunes are given by 


1 tr M, y 
Vr y = on arccos EX MM . 


The 6 dependent tunes are simply 


Vax y (0) = x arccos (0) : 


T 


which contain the tunes and the chromaticities to arbitrary order. For exam- 
ple, vz, are the constant part and £;,, are the linear coefficients. 


8.3 A Glimpse at Nonlinear Effects 


Linear motion around a fixed point is completely classified by the two cases 
we discussed previously, namely the stable or unstable case. This situation 
is fundamentally different in the nonlinear case; it is in fact much 
more complicated and interesting, and represents one example of the modern 
research field dealing with just such questions. 

While this is not at all the place to try to develop a complete understanding 
of the nonlinear effects that may appear, let us spend some time to stake the 
territory and make some general observations. First we may expect that as 
long as the motion is close enough to the fixed point, it is dominated by 
linear effects, and depending on whether we have stability or not, we see 
either stable elliptic motion or unstable hyperbolic motion. While we may 
expect that linearly unstable motion will in most cases also stay unstable if 
we consider the nonlinear effects, linear stable motion will not usually 
stay nonlinearly stable. In fact, if the amplitudes of the motion become 
large, the effects of nonlinearity will become noticeable over-proportionally, 
and eventually they will become dominating, in most cases leading to insta- 
bility for large amplitudes. 

One can then try to heuristically separate the phase space into a region that 
appears stable for a reasonable number of turns, and a region that appears 
unstable. According to the previous arguments, in most cases the stable 
region will be near the fixed point, and the unstable region will be away from 
the fixed point. The region of transition between the apparently stable and 
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apparently unstable parts is usually called the dynamic aperture, and it 
often looks like a deformed ellipse. 

Let us now study a little what conditions seem to favor stable or unstable 
motion, respectively. If we divide the phase space regions into parts in which 
the nonlinear effects have a tendency to pull particles away from the origin 
and those that tend to push the particles toward the origin, then we may 
expect that we want to avoid situations where the particles spend too much 
time in the “pull away” regions, and it is better if we sample the phase space 
uniformly, and thus average out the effects as much as possible. 

A nearly uniform sampling of the phase space happens if the linear tune is 
not a rational multiple of 27. On the other hand, if the tune is of the form 
Li = 27p/q, after q turns the particle will come back to where it was before 
and hence can see the same effect, a situation which we call resonance; so 
it is at least not a good idea to choose q too small, as repetition after large 
numbers of turns is not as critical. The effect of resonances in a circular 
accelerator is of great importance to its performance. Chapter 11 is dedicated 
to studying this topic in detail. 

We may also wonder to what extent it is possible to perform a transfor- 
mation to normal form coordinates in a similar manner as in the linear case. 
As it turns out, most systems cannot be brought to a normal form in which 
the motion is exactly circular; the existence of such a transformation is tanta- 
mount to the system being integrable, i.e., having one integral of motion per 
phase space dimension. Truly integrable systems, however, are very rare. It 
turns out, however, there is a powerful order-by-order iterative procedure to 
turn a system into nonlinear normal form up to any given order [5]. A simple 
example of this procedure is given in Section 11.4. 


Chapter 9 


Lattice Modules 


In the design of actual devices for the manipulation of beams, it is impor- 
tant to employ field arrangements that achieve the basic features of steering 
the beam as a whole to its desired location, as well as keeping the beam 
close together over possibly extended distances, which is achieved through 
various focusing mechanisms. Thus in both single pass lines and rings, there 
exist different sections that perform different functions which require differ- 
ent types of lattice modules. Modern accelerators and beam transport lines 
focus the beam transversely using alternate-gradient focusing, also called 
strong focusing, which evolves from the so-called weak focusing used in 
betatrons and early weak focusing synchrotrons. In the weak focusing ma- 
chines, inhomogeneous dipoles were used to bend and transversely confine the 
beam simultaneously. This is possible due to the fact that an inhomogeneous 
dipole with 0 < n < 1 focuses the beam in both x and y planes. 
From eq. (4.6), we have, for 0 < n < 1, 


ies cos( /1 — nó) (p/ 1 — n) sin(V1— no) 

D — (V1—n/p) sin(/1— n9) cos(/1 = nó) 
ma ( cos(/no) (p/n) 2 

=| (ym/gsin(/Ró) ^ co nó) j 


Yet weak focusing was eventually replaced by strong focusing because weak 
focusing was too weak to confine the high energy beam. Here is an example 
of the Tevatron at Fermi National Accelerator Laboratory (Fermilab, FNAL), 
Illinois, USA. 


Bo c4 T, E= 10° GeV, P = 10? GeV/c. 
P Pc E 10? x 10? 


er Boq L qBoc qBoc | 4-3x 108 E 
For n — 1/2, we have, 

ƏB |) Bo 14 V 

y) n2 uere eRe gee T 

dz |p ~ 2800 i yn 


NB ete = 2-5 x 1073. 5 x 107? = 1.25 x 1074 T. 
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FIGURE 9.1: Sketch of a FODO cell without bending magnets. 


Yet for quadrupoles, B|,-, ~ 1-3 T. Thus we conclude that weak focusing 
at the 1 TeV of the Tevatron is four orders of magnitude weaker than strong 
focusing. 


9.1 The FODO Cell 


The most common form of strong focusing module is the so-called FODO 
cell, where two quadrupoles of opposite polarity (focusing (F) and defocus- 
ing (D)) are separated by drifts or homogeneous dipole magnets (O). Due 
to it simplicity, we can easily derive the transfer matrix of such a FODO 
cell. To simplify matters even further, we choose the center of the defocusing 
quadrupole as the start and the end of the cell and use the thin lens model of 
the quadrupole. First, let us consider a FODO cell without bending magnets, 
as shown in Fig. 9.1. The total transfer matrix of the horizontal plane is 


ne m aIG Gus JG 1) (ver J 
= [oia D0 t) an Denn 0 DG) 


7 1— d/2fi d 14+ d/2f»5 d 

— M/2f5-1/2fi—- dAfife 1+ d/2fo)\l/2fe—-1/2fi- dAfife 1-d/2fi 

= 1—d/fitd/fo—@/2fi fa 2d (1— d/2f1) 

AV fa AJ fa- d/ fifa d/2f3- PJAf f2. 1— d] fit d/ fa— @/2fi fo) 
Hence we obtain the transverse tunes, which are 


cos (us) = 1 oy 2 cos ( E s d 

P hof fife "y fe fe 2Rhf 
Note the second equation above is obtained through interchanging fı and f2. 
The shaded area in Fig. 9.2 shows the range of fı and fo where motion in 
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.d. 
2f 


FIGURE 9.2: The “necktie” diagram showing the stability region of a 
FODO cell. 


igr B QD B IQF 
FIGURE 9.3: Sketch of a FODO cell with bending magnets. 


both planes are stable, i.e., |1 — d/ fı -- d/ f; — d?/2 f f2| € 1, |1-- d/ fı —d/ fa— 
d? /2 f, f2| < 1. Because of the shape of the region of stability, this and related 
similar figures are often referred to as a necktie diagram. 

When f = fi = fo, we have 


d 
cos (Ha) = cos (uy) = 1— 2p 
and the condition for stable motion is 
d? d? d? d 
1- — | < 1 = -1 < 1 - — < 1 = 0 < — < 2 — f > —. 
| 2P] $ E LL E EU 


From the geometrical point of view, when f « d/2, the particle that is parallel 
to the optical axis at the center of the defocusing quadrupole at the start of 
the cell would cross the axis before the center of the defocusing quadrupole at 
the end and bend further away from the axis. The transfer matrix for f — d/2 


is —I, so the maximum phase advance of a cell is 7. 
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Next, let us consider a FODO cell with bending magnets, as shown in 
Fig. 9.3. Here we assume that the drifts between magnets are negligible. 
Furthermore, the bending angle 0 is small. Since l is nearly a constant, the 
transfer matrix of the dipole is 


F cos 0 psinÜ p(1-— cos0) 1 l 10/2 
M = | -(1/p)sin0 cos sin g ~{0 1 6 |, 
0 0 1 00 1 


where | = p is the arc length of the dipole and 0/p — l/p? « f {l<2f,p > 
f 

Note that the factor (1 + no) / (2 + 70) does not appear due to the fact that 
dp/po is used. At an energy that 7o > 1, (1 4- 5o) / (2 -- po) S 1, and the 
difference between dp/po and d.K/ Ko becomes negligible. The matrix of the 
cell is 


1 00 1 l 16/2 1 0 0 1 l 16/2 1 00 


M,=|-1/2f 1 0|[/0 1 9 1/f 1 O}f 0 1 6 ||-y2f 1 0 
0 SE PO 0 1| 1/N0 0 1 0 11 
15g"  2A(1+1/2f) 210 (1+1/4f) 

=| —l/2f?+P/4f? 1-0P/2f?? 20(1-1/4f — /8f?) |, (9.1) 
0 0 1 


and hence we obtain 

RUN RE dne: 21 [1 $ sin(u4 /2)] 
2f? sin(js) 

Az = 0 => 0, = 0, focusing at the ends => 6, = bemar: 


cos(u;) = 1 — 


Note that when QF and QD (see Fig. 9.3) have the same strength, there is 
symmetry between the two planes. The transfer matrix of the vertical plane 
can be obtained through changing the sign of the focusing, i.e., (f— > —f). 
Furthermore, the transfer matrix of the cell that starts and ends at the centers 
of the defocusing quadrupoles can be obtained the same way. The results are 
summarized below. 


2l [1 — sin(u/2 
Hy 5 Hs, Bymax = Bemaxr Pymin = Bamin = Z 
Note that l/f determines p, and l determines 8. and 6,,,, when y is fixed. 
With the upper limit of the magnetic field B of the dipole, which give the 
upper limit of 0, thus the lower limit of the number of cells, the lower limit of 
the size of a ring can be obtained. (The upper limit of B is roughly 1.5 T for 
warm (normal conducting) magnets and 8 T for superconducting ones.) 


min 
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Similarly, we have 
l0 [1 + (1/2) sin(u/2)] 


sm (u2 — 
DL-0, Db5-0, 


16 [1 — (1/2) sin(u/2)] 


Dax = ; 
sin? (11/2) 


Dyin md 


where the subscripts F and D denote the centers of the focusing and defocus- 
ing quadrupoles, respectively. When u = 90°, we have 


l 
f= V2’ 

Geo (2+ v2) Bain = (2- v2) l, L— — 3-22 ~ 5.8, 
pcs ; (4 T, v2) 10, Dmin = ; (4 = v2) 10. 


As an example, let us consider the Main Injector FODO cell at Fermilab, 
which has the parameters 


l = 17.2886 m, p= 


N| 


2 


min 


Bos (2+ v2) ~ 59.0 m, be = (2- v2) te 10.1 m, 
and which is shown in Fig. 9.4 [33, 21]. The magnetic field and length at the 
momentum of 8.9 GeV/c are 


B=0.102T, 1,—6.006m, x4-29.60 Tm, p= 2 — 291 m, 


2 
0= cae) = 41.9 mrad (2 magnets per half cell), 
p 


Bos I (4 f v3) 10 =1.96 m, Din = I (4 E v2) 19 = 0.94 m. 
It is worth noting that the values of a and Dmax are very close to those 
obtained from the exact model, which are 58.2 m and 1.95 m, respectively. 
This shows that the thin lens model is rather accurate for a typical FODO 
cell. 

Now let us look at the size of the beam. The Fermilab Main Injector 
is designed to accelerate proton and anti-proton beams of emittance up to 
407 mm mrad. Following the Fermilab convention, the emittance is defined as 
EN = berms bY, (B = v/c, y = 1/1 — 82), where erms is the rms area of phase 
space occupied by matched beam. The quantity ey is called the normalized 
emittance. The factor 8y makes ew a constant through acceleration — note 


that ey X €rms By ox Boy (a2) (a2) — (xa)? ox 4/ (a2) (p2) — (xps). The factor 
6 means that the size is 60 ~ 2.450, which contains ~ 90% particles. As a 
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FIGURE 9.4: Lattice functions of a FODO cell at the Fermilab Main In- 
jector. 


result, we obtain 


Dax EN 
Tmax = Ymax = ~ By ` 


At injection, with po = 8.9 GeV/c, we have y = 9.54, 8 = 0.994, so we 
have max = Ymax ~ 16 mm. For the horizontal beam size, we have to take 
into account the momentum spread. We assume óp/po ~ 0.3% and have 
tp = Dqaxóp/po = 6 mm. As a result the total horizontal beam size is 
vla. = V/XAax t 1h ~ 17mm. So at injection, the full beam is about 34 mm 
wide. At extraction, the momentum is 150 GeV/c, 6 ~ 1 and y = 160, so 
we have max = Ymax = 4 mm. Since óp/po scales with y, the momentum 
spread becomes 0.02% and xp = 0.4 mm. Thus, we have zT y ~ 4 mm, and 
the full beam at extraction is 8 mm wide. This is called adiabatic damping. 
With acceleration, all phase space variables scale the same way. As a result, 
the shape of a bunch does not change. It is illuminating to compare radiation 
damping and adiabatic damping. In the case of radiation damping, po remains 
constant while py, p, and ôK decrease. (The radiated energy is recovered by 
the radio frequency (RF) cavity.) During adiabatic damping, pz, p, and ôK 
remain unchanged, while po increases. Both dipole and quadrupole magnets 
have to be ramped to keep the design closed orbit and tunes constant. The 
stronger force results in a smaller beam. 
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Finally, let us look at chromaticities for FODO cells. 


1 = 1 Bmax Bmin 
E= -E fotos = o (Aas - Aan) 
1N 2l 
x (1 + sin(Z) eost sin(£)) Tm 
.  Na4sin(p/2),. HM N die st tan(u/2) ie Nu 
cO ung cce gO gal ch MECESO 


For the Fermilab Main Injector, 


; du =26.425 => £, = —33.6, 
, Vy =25.415 = &, = —324, 
which is not far from the exact values (£, = —33.6 and £, = —33.9) showing 


again the usefulness of the thin lens model. Without correction, and assuming 
the momentum spread of +1%, we have 


A 

Av, = £j E = +33.6 x 0.01 = 0.336, 
Po 
A 

Avy = £j 2E = £32.4 x 0.01 = 40.324. 
Po 


Clearly, without chromaticity correction, the momentum acceptance of the 
ring would be very small (probably below 0.1% due to the fact that the dis- 
tance between v, and the half-integer is only 0.075). To correct chromatic- 
ity, we place two sextupoles in each FODO cell, one next to the focusing 
quadrupole and the other next to the defocusing quadrupole. Using the thin 
lens model of the sextupoles and ignoring the distance between the sextupoles 
and their adjacent quadrupoles, we obtain the total chromaticities from eqs. 
(8.4) and (8.5) 


1 
& = = $ (s)[kx(s) - Da (S). (9) ds 
1 


=e = (+ = Dass) + Bmin (- - Danks )| ; 
& =~ f2) a(s) DG) (9) d 


1 1 1 
S Ge min | 77 maxka max | F Dminks , 
ie [an (=F D + Boas (F + Dankon )| 


where ksp and ksp are integrated strengths of the sextupoles next to the 
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focusing and defocusing quadrupoles, respectively. When 


Pia M (71^ a 
BM FDmax — 229 [ + sin(u]2)/2] 
ond loc. sin (u/2) 


fDmin  2f°0 [1 — sin(u/2)/2]' 


the chromaticities are corrected. Note that the relation | = 2f sin(u/2) is 
used to obtain the above expressions. 


9.1.1 The FODO Cell Based Achromat 


Achromats are needed because dispersion free straight sections are needed 
in both circular accelerators and beam transport lines. Achromatic sections 
are also required in circular machines, where, for example, straight sections 
that house injection and extraction kickers are dispersion free to make the 
beam small. The straight section where RF cavities are located is also dis- 
persion free. Passing a RF cavity with r-ó correlation produces coupling 
between transverse and longitudinal motion, which is usually undesirable. 
In the case of beam transport lines, achromatic conditions have to be met 
when the matching requirement is such that the line is imaging or the line is 
isochronous. 

'There are mainly two types of achromats, those that utilize repetitive sym- 
metry and those that use mirror symmetry. Let us consider a system that 
consists of n identical cells. 


: (zl) (xa) (xð) Rd 
M = | (ala) (ala) (alà) | = 
0 1 0 1 
BR m. R (R+Îd 


which is an achromat. 
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For a stable FODO cell, which is what we are interested in, we can write 
the matrix R explicitly, which is 


R= cos u + asin u DB sin 
"m —*ysinu cos—asinp 


In the normalized space, 


B= APRA“ = cosu sinp 
—sinjs cosp 


where 


S d/ VB 
d= Ad = 
mom 
with d = (x|ó) and d’ = (a|). As a result, we have 


A os (k in (ku) 

Xm cB 
n( 
s( 


Sa _ =| cos (ky) sin "y 


— sin (ku) cos 
(etke ae gu /2 (etke - g M /2i 
— (ee — RM /2i (eth + go /2 


Il 
> 
ll | 
o 
4 7X 


7 (A+ B)/2 (A- B)/2i 
|». A-(A- B)/2i (A+ B) /2 
where : 
iet M | m einn Tz ERE 
= ikp uo —ikn _ 
e 2 1 — ein" AS ve Loe 


It is obvious that when u = (m/n)27, we have A = 0 and B = 0, which leads 
to the achromatic condition. 

Achromats of this kind have been used as the arcs of storage rings, beam- 
lines and spectrometers. Examples are the 90° arcs of the South Hall Ring 
at the MIT-Bates Linear Accelerator Center at Massachusetts Institute of 
Technology, Massachusetts, USA, the 180° arcs of the storage ring at the 
Duke Free Electron Laser Laboratory (DFELL) at Duke University, North 
Carolina, USA, and the arcs of the ILC (International Linear Collider) Beam 
Delivery System at SLAC National Accelerator Laboratory, California, USA. 
There was a time-of-flight spectrometer built at Los Alamos National Labo- 
ratory, New Mexico, USA, that consists of four identical cells with u = 90°. 


216 An Introduction to Beam Physics 


It turns out that this kind of system not only cancels dispersion, but also 
cancels all second order geometrical aberrations. Recalling the equations of 
motion (3.22), second order geometrical aberrations generated in the short 
interval [s,s + ds] can be written as 


cy = ri + hdszxiaj, ay = ai > TT. um sttolyr oj, 
Uf = yi + hdszibi, by = bi + Tok imnafalyr by, 
where the summation > is taken over k,1,m,n from 0 to 2 such that k +1 + 


m +n = 2; so in the above, 
2 


Xo reads as 5 


k,l, m,n=0 
k+l+m+n=2 


This simplified description is used in the rest of this section unless otherwise 
noted. Here the linear matrix from s to s+ds has been removed by the inverse 
matrix. As a result, the second order map over the interval is lumped into a 
point. Applying the transformation 


x x l//B. 0 0 
a Ox M br V Be 0 
y y 0 0 1//By 0 
b 0 Ogle By By 


0 
0 


i 
Il 
m 


c cc gs 8 


we obtain 
qf v] / Bs. x + hdsxa V Pss 
dg| | (oum--85a)/VBs | fat 3 Ta im nz haly"b"| | x8: +4i)/VBr 
jr wVBÀ, [| y + hdszb i Bj 
by ak buds b+ YT sim nta y" b^ (ayi +b:)/ By. 
+ (hds/ / B.) & Ti (Arti + à;) 
a; + ET, kirn, aX alg min 
J: (VBzhds/8,) s (—oyg +8) 


bi + Y. k,Lm,n; kalgmpn 


where T equa and E NR are linear combinations T, i 1,5,» and Tp k,l ,m,n, 
each of which is multiplied by some powers of Bs, by, o, and/or oy. Now let 
us consider a system that consists of n identical cells with the phase advances 
ple = ply = 2nfn. 

Defining 


I 


coc 8 R 
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the sum of all the n kicks over the whole system is 


Er (hds//Bz) x (—a,x +a) 


ae » R ( (n — m) Hz -$ e YT, k Im nvr aly™b” 

Up | Nm-nmn - by (VBrhds/By) x (—ayy + b) 

by YT, kim nr aly™h” 

TI T Pa 
i (n T 4 ) 
(hds/A/B.) x (asx + a) 
E Yn ( 2mm /n d A Y kim nz aynb" 
m=0 -2mmn[n — by (V/Bchds/ By) x (—ayy + b) 


etme oreo” 


The last step uses the fact that nur = nuy = 27. Every term in the above 
expression can be written in the form of 


Saw (+) (ts) 


m=0 


where | = 0,1,2,3 and j,k = {x,y}. Using the relations 


id ~id id _ pid 
cos = 4, sing = >, 
we have 
3-1 2mm 
5 cos! | 22 4 $j | sin + ok 
m=0 
= n-1 ei(2mn/n+¢;) ME eg (Qm /nó;) l gi(2 mm [n ó;) = eg (Qm /nó;) 3-1 
7 2 2i 
m=0 


Dropping the common parts in each sum, which are functions of $4, and $,, 
there are only four kinds of sums 


n-1 in6r/n n—-1 —in6r/n 

X gi6mm /n = l~e f —0 ) eg (mnn 2 l—e / =0 
A T ción /n NA m Lt e-i6n/n UU 

m=0 m=0 

n-1 1 gin2n /n n-1 is eg in2n [n 


3 gm n _ AE = 0, 5 g- mmn _ T m 0, 
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when n Æ 1,3. In conclusion, for a system that consists of n identical cells 
(n > 1, n Z 3) and py = py = 27/n, all second order geometrical aberrations 
vanish. Note that this is true even for systems without midplane symmetry. 
When coupling is present, the second order kicks for x and y will become more 
complicated but remain a polynomial of the second order. The transformation 
to the normalized coordinates A will be coupled as well, yet the general form 
of the kicks in the normalized space remains unchanged. Therefore the same 
proof holds. 


Another result of such a system is that some chromatic aberrations are 
canceled. Of all the remaining chromatic terms, only two are independent. 
With two families of sextupoles, all second order chromatic aberrations can 
be corrected. Thus we obtain a system that is free of all aberrations up to the 
second order, which is called a second order achromat. From the symplectic 
condition, we know that, up to the second order, the path length depends on 
ô only. 


Next, we are going to prove that only two independent families of chromatic 
terms are left. Going back to the equations of motion, the second order 
chromatic terms are 


Lf = Tti T 1,a50i0, Qf = Qi T Ta, s5tió A T, 520°, 


yf = yi + Ty bsbið, bg = bi + Toys yió. 


In the normalized space, the map becomes 


Tf v/ / Ds. t+ Tr aað V Baki 
ar Qrt + Bra) / Bs. a+Ta 5x0 + T, 520? (Oats + d.) [V B 


Wl y/ V/ By ° y + Ty,p5bd 1 Py 
bj \(ayy + 8,0) / VB b+ Thysy6 (—aydi +b) /VBy 


z// B VJ Baki + Tx,a6 (azi + ai) / VBa] ô 
jist: Ba) IVB (—arči + d;) IV Be + Toss Dado + T, 520? 
y/ / By Byyi + Ty bô (esi +8) / P] ô 
ayy + Byb) / / B. (7e + bi) / f By + Tous y Bai: 

Ti + Tx ,a6 (arti T aj) / B) ô 
di + |(82Ta z T aTe as) /B«] Tið + (au / Bx) T5,a50;0 + V/ BT, o2 8? 
bi + |(82To, y8 — o2Ty as) / By] Yd + (ay/by) Ty b5bið 


Il 
Oo 
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'The sum over the system is 


Ty 

~ n—1 

df —2mm [n — bx 
PS = R 

UF > Eal 
br 


Tx,a5 |( ox + a)/ B] 6 
[(82Ta, zô d & T; o8) Bs] có + (Qs / B) Ts aað + V Bx Ta, 520? 
Ty,b5 [(—ayy + b)/ B] ô 
[(BTo us — OT y.05) / By] yd + (oy By) Ty ps0 


2mm [n + be 
i Cas T ' 


Since the z and y planes are decoupled, we can separate them. The z plane 
is 


Tf E cos (2mm / n + dbx) — sin (2mm /n + dz) 
a p ( H be) ) 


f \ sin (2mm n 4 cos (2mm /n + oz) 
: = (ax/Bx) Tr,056 (1/85) Tr,a55 
|(B2Ta z = aTe, as) / B] ô (o / Bx) Tx,a50 


d cos (Qma/n+ oz) sin(2mm/n + dz) d, 
— sin (2mm/n + x) cos (2mm/n + Qx) 


[7 
V (cos (2mm n 4- à;)  — sin (2mm/n + ġa) 0 
z b» sin (2mm/n-4- $4) | cos(2mm/n 4- dx) V B.T, 5202 J ' 


and the y plane is 


jr P cos (2mm /n + Qy) — sin (2mm /n + dy) 
d p | + by) 


Km \ sin (2mm /m 4 cos (2mm /n + oy) 


— (@y/By) Ty usó (1/By) Ty, b58 
[(85 Toys = aT y,05) /By] ô (ay/By) Ty, bð 


( cos (2mm/n + oy) sin (2mm/n + 2) B 


— sin (2mm/n + y) cos (2mm /n + dy) b; 


The terms (x|?) and (a|0?) are 


20 


in2 —in2 
E (ws n/n ma. in 7) zs 


= a 2mm ) Y giQmn/nós) _ g-i(2mm n ó;) 


m=0 


1 — ei27/n EE 1— e-i27/n 
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et2mn/n+¢x) 4 e iQmn /nós) 


n—1 n— 
Yes (222.9, = Y 5 


m=0 m=0 
1 "EF in2m /n mq m —in2m/n 

emt eios aes MM e tba pees =0. 
2 1— ci2n /n T— e-i2n/n 


Every other term in the above expression can be written in the form of 


aer , ( 2mm | 9-1 f( 2mm 
5 cos | — + ¢; | sin — +4; 
n n 
= [-—— xi eg (Qm /nó;) | l [-——— - e (Qmm /né;) | 2-1 
0 


2 2i 


where | = 0,1,2,2 and j = {x,y}. Dropping the common parts in each sum, 
which are functions of $;, there are only three kinds of sums 


n—1 i= e in4n/n n—1 


5y eA mm /n Z T » e imm [n = Te 5 le n, 


m=0 m=0 m=0 


where $575. 1. gi4mz/n = 0 and 7") e7#4™7/" = 0 when n # 2. As a result, 
we have 


n—1 n—1 
2mm n i 2MT n 
Dele) C 


m=0 
n-1 
2 2 
Y di (= +4) ds (= +95) ~0. 
mc TL TL 


The remaining terms are 
af 2 —C, 0 a; J’ by ED —Cy 0 bi : 


1 i es 26 — aT ad 1 B?To yò = aT, bó 
Cy = —T,,50 — 2S m6, Cy = >T, p90 — ———— —— 371 , 
a ws Ba ZEE "us By 
which shows that there are only two independent terms. 
It turns out that there is another way to prove this point which is probably 
more elegant. We first observe that, from the equations of motion (3.22), the 
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second order map of the n cell achromatic system in the normalized coordi- 
nates can be written as 


Tf Li T Ty 51,0 T Pasa “Tr Ty 5262 

a Gi + Ta es uÓ + Ta agdið + Ty, 526° 

uf = i j| aði D ,92 l (9.2) 
Yf Yi T Ty ys yid T y bôbið 

by b; + Ty, ys9id + Ti b5bið 


where all geometrical terms vanish. Note that midplane symmetry is obeyed. 
Since the x and y planes are decoupled, let us study the x plane first. Let us 
denote the second order map of the whole system 


M (n) 2 Z-47T (n), 


that of one cell is 


M (1) 2 R (1) - T (1), 
and that of n — 1 cells is 
M (n — 1) 2 R(n—1)- T (n— 1). 


Since the n cells are identical, the whole system can be viewed as either one 
cell in front of n — 1 cells or vice versa. Thus the following relations hold 


M (n) 23 M (n 2 1) eM (1) 22 [^R (n 21) + T (n - 3] e [R (1) + T (1)] 
—3 1-4 R(n-1)oT (1)-- T (n - 1)o R (1), 


and 


M (n) 23 M (1) o M (n — 1) 22 [R (1) T (Jo [R (n — 1) + T (n — 1)] 
-3I-4 R(1oT(n-1)4T (1)o R(n-— 1). 


Removing the first order part, we obtain 
T (n) 2 'R(n—1)oT (1) - T (n —- 1)o R (1), 


and 


T (n) =R(l)oT(n—-1) + T (1)o R(n-— 1). 
Furthermore, we obtain 
T(noR()!-2R(n-1»eT()oR(1) !-T(n-1), 


and 


Rü)'eT(n2T(n-1)-4R()'eT()oeR(n- 1). 
Using the relation R (n — 1) = R (1) ! , we reach the following relation 


T (n) eR. (1) 2 R. (1)o T (n). 
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Plugging in the first two components of eq. (9.2), we arrive at 
COS fla SİN [Ly T, 6202 E T, 59? 
— SİN Ug COS [i T, 520? T, 5? 
and 


T0560 T asô COS Uy Sin Hg COS [lz SiN Hy Tao xô Tag 
Ta,266 Ta asô ) V — SiN fe COS fe = sin Hy cosus JV Ty055 Ta,asd ) 
For the pure chromatic terms, we have 
l — COS uy —SiIN Hg T, 62 u 0 
Sin uy — 1-— COS Ux T. TAS 
Since, for n > 1, we have 


T= costy  —sinu; Y — 
det ( sin Ug [uie AN Cos hale ts 


we conclude that 


For the mixed chromatic terms, we have 


= COS Uz — Tras sin Hz Toys sin Hx T Ex,a6 COS Lax ) 


Ta,x6 COS Uz — Taas sin Hz Tzs sin Hx T La,að COS La 


Tr ,x6 COS La T La,xó sin Ha Taus COS Uz T La,ad sin Ha: 
SLri, sin Hx T iaaó COS Ur —im,aó sin Hx T La,aó COS Ma 


For each component, we have 
7 -22,aó sin Ha = dars sin Has Tegs sin Hz = T'a;aó sin Lis, 


~oas Sin fe = Teas Sine, Iagan [uy = — Tras SN pua. 
For n Æ 2, sin uz zz 0. We have 
Tizi = Toas =0; Toas + Ta zô =0. 
From symplectic symmetry, we have 


1x. ud ios 
det | Lom ee =1 
Ta, xô 1+ T4450 
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up to the first order of 6. As a result, we have 
Tosi T Ta að = 0, 


which, combined with the previous result, leads to 


Tras P 0, T5,a6 — 0. 


In the original space, 


and 


Ters T as = 
Ta,x6 Ta,a6 = 


ái : -— : 
Ta,25 a6 
de wv Jute vc) 
os B " Tas 0 Qg / V Bx VBe 


= E Pe Jt Tra: 
—Yr Agr 


The Twiss parameters here are the periodic solution of the cell. In conclusion, 
there is only one independent chromatic aberration in the x plane. The same 
conclusion can be reached for the y plane following the same procedure. This 
proves, from the global point of view, that only two second order chromatic 
aberrations are independent. 

Furthermore, it is easy to show that the remaining terms T eod and T, bs are 
simply the chromaticities. We can write the transfer matrix in the normalized 
space as 


Res 1 HET E cos (27) sin (27) + T, asô 
ana zi d ~ \ sin (27) — Tp ad cos (27) 
cos (27 + T,069) sin (27 + T, 5) 


— sin (2r + Ta) cos (27 + Traat) ' 


=1 


and we obtain 


bx = i»,aó- 


Similarly, 
me cos (27 + m) sin (27 + 7,659) 
—sin (27 + m) cos (27 + m) 
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and " 
Sy = Ty,bs- 

The simplest of such a second order achromat consists of four FODO cells 
with us, = Hy = 1/2 and two families of sextupoles correcting the chromatic- 
ities. 

9.1.2 The Dispersion Suppressor 


As it become clear shortly, an achromat of n identical cells is not optimal 
in terms of minimizing Dmax. Let us consider half of an achromat, 


ey gr 
= (3 a 


For an n cell achromat, D, D' at the center are 


D -1 0 d 0 d 
D' — 0 —1 d' 0 = d' , 
1 0 0 1 1 1 


whereas the periodic solution is 


D ex 0 dx 
D^ We), 0. saa? do p 
1 0 0 1 1 
" di 
pie pes 
= 2 2 


Obviously the dispersion at the center of the achromat is twice that of the 
periodic solution of a cell. There is a module called dispersion suppressor 
which makes the whole section an achromat while maintaining the periodic 
solution in the regular cells. It takes advantage of the fact that dipoles, espe- 
cially when 0 is small, affect only the dispersion, not focusing. A dispersion 
suppressor consists of two FODO cells which are the same as the standard 
cells except for the bending angle. Two free parameters, or “knobs,” such 
as the the bending angles can fulfill the two conditions needed to obtain an 
achromat. 
Recalling eq. (9.1), the transfer matrix of a FODO cell with bending is 


1-B/2f? — 21(1+1/2f) 210 (1 4- 1/Af) 


Ma =| —1/2f?+l/4f> i-Py2f? 29(-1/4f—-D/8f2) 
0 0 1 
COS u B sing 20 [1 + (1/2) sin (u/2)] 8 


= | — (1/8)sinu cosu 2[1 + (1/2)sin (u/2)] [1 — sin (u/2)]0 
0 0 1 
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For two cells of bending 01 and 62 per half cell, the total matrix is 


COS u Bsinu 2l [1 + (1/2) sin (u/2)] 02 
My =| —(/B)smp cosu 2(1 + (1/2)sin (u/2)] [1 — sin (u/2)] 82 
0 0 1 
COS u Bsingu 2l [1 + (1/2) sin (u/2)] 01 
—(1/8)sin cose  2[1-- (1/2)sin (u/2)] [1 — sin (y2/2)] 6 
0 0 1 
cos (211) Bsin(2u) d 
-l-ü/8smQu) cosQu) a’ |, 
0 0 1 
where 
d —2l h a sim (5) 6; cosu 4- 2 h 4 L sin (5) [1 — sin (£)] 6, B sin u 


421 [t+ 5 sin (5) 00 


=) i + sim (5) [(2cos u + 1) 01 + 82], 


= ess (4)| [1 — sin (£) [(2 cos u — 1) 061 + 62]. 


Note that the relation 
B 2l [1 + sin(u/2)] 
7 sing 
was used during the derivations above which lead to the final forms of d and 
d'. 
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To match a dispersion free region to a FODO cell, we have 


cos(2u) | fsin(2u) d 0 Dax 
—sin(2u)/B  cos(2u) d' 0] = 0 ; 
0 0 1 1 1 
which leads to 
0 


2 cos p + 1) 04 + 05 = ——————, 
od E RS 


(2cos u — 1) 01 + 05 = 0. 


'The result is 0 
Bp = ——, obi Bi: 
* Asin? (u/2)? ^ : 


So, we have 


wlaAwla 


Dispersion suppressors are widely used in high energy accelerators where 
the achromatic straight section constitutes only a small portion of the ring. 
Since the main part of the ring is made up of arcs, the cost-effective way to 
build such a ring is to pack dipole magnets as close as possible. A FODO cell is 
the best choice for this purpose. Another way to save cost is to keep the beam 
pipe as small as possible, which saves not only due to smaller pipes themselves, 
but also, more importantly, smaller magnets. Dispersion suppressors help to 
keep beam size small by keeping the dispersion matched. In fact, there is 
another parameter that plays an important role in the optimization process, 
which is the length of the FODO cell. Both Bmax and Dmax are proportional 
to the length of the cell. A shorter cell leads to smaller beam size, but tends to 
decrease the packing factor, which is the ratio of the length of total bending 
over the total length of the cell. 


9.2 Symmetric Achromats 


Achromats based on mirror symmetry are widely used in beamlines and 
accelerators, especially synchrotron light sources. One difference between a 
synchrotron light source and a high energy accelerator is in the number of 
experiments it supports. While a high energy accelerator usually supports 
around ten fixed target experiments and a handful of collider experiments (less 
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than five), a synchrotron light source usually supports tens and sometimes 
more than a hundred experiments with the circumference of the ring only 
a fraction of the high energy counterpart. Furthermore, insertion devices 
(wigglers and undulators) have become the main source of light, as opposed 
to bend magnets. These requirements result in a ring divided into many 
sections (usually identical ones) with long straight sections in between where 
dispersion is either zero or small. Apparently FODO cells plus dispersion 
suppressors are not well suited for this kind of ring. The solution has been 
mirror symmetric achromatic sections with relatively long straight sections at 
the ends. 

Before going into the details of the lattice modules, let us first look into 
the general properties of a mirror symmetric cell. Mirror symmetry here is 
referred to as the symmetry between the cell and its mirror image of the x-y 
plane. In other words, a mirror symmetric cell means that the optical elements 
of the cell are symmetric about the center, both in terms of geometry and the 
excitation of the fields. For example, a quadrupole is mirror symmetric and 
a sector bend is mirror symmetric, too. When a cell is mirror symmetric, the 
map of the cell is the same as that of its mirror image. To obtain the map of 
the mirror image cell, we observe that a particle that enters the mirror image 
cell with (zy, —a,, yy, —bp) exits it with (xj, —ai, yi, —5;), where (zi, ai, Yi, bi) 
and (£f,af,yf,bf) are the entrance and exit coordinates of the transverse 
phase space of the original cell. Hence the map of the mirror image cell is 


M! =Ro MoR}, 


where 


Dq O ODGO 
.oo0oo0ooooODO 
Occ & 8 


] 1 00 alx) (zla) (x]ó)V 0 0 
Mi-|o0-10 (alx) (ala) (ald) -1 0 
0 1 0 0 0 1 


Mirror symmetry entails that M, = MI , which leads to 


(z|a) = (ala) 
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and 
1+ (2|x) 


(v|a) 
Note that the two equations from the dispersion and the dispersion prime are 
not linearly independent. It is easy to verify that the transverse linear matrix 
of a magnetic sector dipole satisfies the above relations. For a linearly stable 
cell, i.e., |(z|x)| < 1, mirror symmetry implies that 

(vv) — (ala) 


= — Z> 0, 
s 2 sin [ey 


(aļð) = (26). 


and 


; _ (1= (2|2)) (ald)  (alz)(x|9) _ 1— (lx)? + (xla)(alz) 2 
D' = ee (26) = 0. 
2 — (a|x) — (ala) [2 — (z|z) — (ala)] (x|a) 
Alternatively, we can also express the transfer matrix of a mirror symmetric 
cell as functions of the first half of the cell. If the matrix of the first half of a 
mirror symmetry cell is 


'The matrix of the whole cell is 
MT = xr xr, 

(ala) (la) —(ala)s(]3) + (ala)a(ald): V / (ele): Gl). (ald): 
= | (alx)1 (xx) —(alx)1(2]d)1 + (ælz)ı(aļô)ı || (@lz)1 (ala) (al0): 


0 0 1 0 0 1 
(z|x)ı (aja) + (z]@)1 (ala) 2(x\a)1 (aja)1 2(x\a)1(a|5)1 
= 2(v|z)1(a|v)i (z|x)ı (ala)i + (zļa)ı (ala). 26e] )i (ald) 
0 1 


In addition to the relations obtained above, this alternative expression shows 
that when (a|ó); = 0, the cell is achromatic. Furthermore it reveals another 
relation, which is 


— 


z|z) 


(a9) = (x|6). 


— 


z|a)1 
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To make the meaning of the relation clearer, let us add the drift of length L 
after the cell. We obtain 


t 3 a uae 7 ricus E 
0 1) V (abri (ele) (ala) Gl — J^ 


and 

1 L|) [2(a(al) | — ( 2[G]a) + L(z|z)1] (a]8)1 

0 1 2(x|z)1(a|9)1 2(x|z)1(a|ó)1 l 
When L = —(x|a)1/(x|x)1, the second half of the cell forms an image and the 
dispersive ray crosses the axis. In other words, the dispersive ray behaves the 


same as the axial ray from the center of the cell. The reader can check easily 
that it is indeed the case for a sector bend. 


9.2.1 The Double-Bend Achromat 


Now let us study the simplest mirror symmetric achromat, which consists 
of two bend magnets and a quadrupole in the middle. Due to the mirror 
symmetry, the achromatic conditions D — D' — 0 at the end can be satisfied 
requiring D' — 0 at the center. 


De 1 00 1 L40 1 L L0/2 0 

o |=| AE 1 0 010 01 6 0], 

1 0 01 001 00 1 1 

De 1 Ls (L/2+ L1)0 0 
0 |=| yay t= (4) Ff [L-—0/200J23 Lı)l8 0 
1 0 0 1 


L 1/L 
DiS On. eee au 
=> (5+ JL f Ten :) 


From the discussion above, the relation f = (L/2 + L1)/2 is simply the result 
of the mirror symmetry. Even with a large bending angle where the exact 
matrix of the bend has to be used, being achromatic always implies that the 
center of the first bend is imaged to the center of the second bend. Since 
f < (L+ L1)/2, it is not possible to build a FODO cell that is stable, so 
a doublet or a triplet has to be used. A simple variant of the double-bend 
achromat (DBA) is the triplet DBA shown in Fig. 9.5, which contains a triplet 
between the bending magnets, with no quadrupoles outside. 

Another type of DBA consists of two bending magnets, a focusing quadrupole 
in between and a doublet outside of each bend as shown in Fig. 9.6. The cell 
is symmetric about the center of QF1. Fig. 9.7 [66, 21] shows an example of 
the lattice function of one example of this type of achromat. 
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FIGURE 9.5: The simplest double-bend achromat (DBA). 
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FIGURE 9.6: The double-bend achromat (DBA). 


9.2.2 The Triple-Bend Achromat 


The fact that the center quadrupole of the DBA images the center of the 
first bend to that of the second makes the DBA lattice somewhat inflexible 
since the horizontal phase advance between the centers of the bends is always 
around m. To overcome this shortcoming, triple-bend achromat (TBA) 
lattices were developed. A TBA consists of three bending magnets, at least 
two quadrupoles between them, and doublets (or triplets) outside, as shown 
in Fig. 9.8. Lattice functions for a typical example of such kind of achromat 
are shown in Fig. 9.9 [65, 21]. 


9.2.3 The Multiple-Bend Achromat 


In the past two decades, the concept of multiple-bend achromat (MBA) has 
been conceived of and developed to further reduce dispersion in the bending 
magnets. As discussed in Section 9.2.4, this will help reduce the emittance 
of the electron beam and increase the brightness of the X-ray produced from 
synchrotron radiation. Fig. 9.10 shows the first MBA lattice, developed at 
the MAX IV Laboratory at Lund University, Lund, Sweden. The middle 
units are very similar to regular FODO cells and the end units are used as 
dispersion suppressors. Recent variants increase the distance between the 
outer most bending magnets and the middle ones to generate a dispersion 
bump there. As a result, the strengths of the sextupoles are reduced and the 
dynamic aperture is enlarged. The complexity of the lattice makes a multi- 
dimensional optimization tool a necessity. In that regard, the spread of various 
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FIGURE 9.7: Lattice functions of a double-bend achromat (DBA), which 
is one of the four super-periods of the storage at Center for Advanced Mi- 
crostructures and Devices (CAMD) at Louisiana State University. 
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FIGURE 9.8: The triple-bend achromat (TBA). 


numerical algorithms in the community greatly expedited the development of 
this concept. 


9.2.4 The H Function 


The design of such triple-bend achromats is mainly driven by the demand 
for small equilibrium emittance for synchrotron light sources. Although syn- 
chrotron radiation is not covered in this book, it is important to introduce the 
concept of the equilibrium emittance which results from synchrotron radiation, 
since it is crucial for understanding the motivation behind the development 
of lattice modules for the synchrotron light sources. The main difference be- 
tween electron and hadron (proton, antiproton and ion) rings is synchrotron 
radiation. Since for a given bending radius the total radiated power is pro- 
portional to 4^ (Y = E/ mc?) , Synchrotron radiation becomes significant at 
a much lower energy for electrons. The energy loss is compensated with RF 
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FIGURE 9.9: Lattice functions of a triple-bend achromat (TBA) of the 
Advanced Light Source (ALS) at Lawrence Berkeley National Laboratory, 
California, USA. 


cavities. 

Since the photons are emitted into a forward pointing cone of the opening 
angle 1/y which is small for electrons of GeV level energy, all three components 
of the momentum decrease at roughly the same rate. The RF cavity, on the 
other hand, increases only the longitudinal momentum. As a result, transverse 
momentum is damped over time. 

Yet the presence of dispersion in a ring causes the emittance to grow due 
to synchrotron radiation. Let us consider an off-momentum electron moving 
along the closed orbit for the momentum in a dispersive region. After a photon 
is emitted, the position and slope of the electron remain unchanged but the 
total energy decreases. Suddenly the orbit the electron moves along is no 
longer the closed orbit for it and the electron starts to oscillate around the 
new closed orbit, resulting in emittance growth. The equilibrium emittance is 
reached when the damping rate equals the growth rate. It turns out that, for a 
ring with an identical bending field, the equilibrium emittance is proportional 
to (H) where 


mag? 


H = YD? + 20, DD' + B,D", 


and 
1 


V mag ~ 2np UM m 


Note that D and D' are periodic solutions of the position and slope of dis- 
persion. Furthermore, is a constant outside of dipole magnets and changes 
inside dipole magnets. To demonstrate this point, let us consider two points 
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FIGURE 9.10: The lattice functions zv, By, and dispersion multiplied by 
10 in the MAX IV multiple-bend achromat (MBA) lattice at Lund University, 
Lund, Sweden. (© 1996 IEEE. Reprinted, with permission, from D. Einfeld, 
et. al., in Proc. PAC 1995, 1, 177, 1996 [26].) 


in the ring. The linear map between them is 


Maa = | (alz)i2 (ala)i2 (ald)i2 |. 


D» (zje) (xzja)ı2 (x|ô)i2 Dı 
D, | = | (aļz)ı2 (aļa)ı2 (aļô)ı2 Di 
1 0 0 1 1 


Hence, we have 


H2 =72D3 + 203D3D3 + 83D? 


=a 55 (7) (51) 
Jo ene] (22) 


( 
| ( @lz)2 Grla)i2 V [ Dr (z|0)12 
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It is clear that H2 = Hı if (x|); = (alô); = 0, which is the case for any 
two points that are in the same straight section. With the expression above, 
we can obtain the derivative of H with respect to s which can illustrate the 
matter even clearer. When the two points are close to each other, the linear 
map becomes 


i 1 ds 0 
dM — | —kds 1 ds/p 
0 0 1 


Carrying out the derivation one step further, we obtain 


ei- uoo (2. a (2 2) (5,) 


ox) Gg») 
(5) 8) Cs YB) QD 8) Cane) 


ds 


=; 2 (a1 Di + £4D1) i 


In summary, we have 


2 
H’ = p (a, Dı + bDi) 4 


In order to achieve small emittance, H has to be small, which leads to strong 
quadrupoles. This in turn leads to strong sextupoles to correct chromaticities 
which in general would result in strong nonlinear motion and small dynamic 
aperture. TBA lattices can provide smaller dispersion and hence smaller 
emittance than DBA lattices, which result in stronger sextupoles and smaller 
dynamic aperture. This is one of the reasons that TBA lattices fell out of 
favor in the most recent synchrotron light sources. Another reason is that, 
when the achromatic condition is not strictly enforced, DBA lattices appear 
to be more flexible than TBA lattices, especially when more quadrupoles are 
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used. For example, the DBA lattice of the Shanghai Synchrotron Radiation 
Facility (SSRF), Shanghai, China, contains two quadrupole doublets between 
the bending magnets and two triplets outside. In fact, almost all synchrotron 
light sources built in the past decade adopted DBA lattices. 


9.3 Special Purpose Modules 
9.3.1 The Low Beta Insertion 


In both circular and linear colliders, the beam is focused as tightly as pos- 
sible at the collision points to maximize the density of particles and hence 
collision rate. The simplest low beta insertion consists of two quadrupole 
doublets placed and excited symmetrically about the interaction point. The 
upstream doublet is roughly a parallel-to-point system, in which an initially 
nearly parallel beam is brought down to a small point and the downstream 
doublet is a point—-to-parallel system. In hadron colliders where the emittance 
is relatively big, triplets are used to increase focusing power and reduce the 
width of the beam in the quadrupoles. In addition, it provides more flexibility 
for tuning. 


Recall from eq. (6.13) that 


s2 


B(s) = BY + .- 


Since the distance between the interaction point and the last quadrupole is 
on the order of 10 m and 8* is below 1 m, the 8 functions in the quadrupoles 
range from hundreds m to over 1 km (see Fig. 9.11). Combined with high 
gradient in the quadrupoles, the low beta insertion generates large chromatic 
and geometric aberrations. In hadron colliders, due to the relatively large 
emittance, the main effect of the aberrations is the additional chromaticities, 
which are corrected by the sextupoles in the arcs. In addition, the large beam 
size at the quadrupoles implies extra tight tolerances on multipole errors in 
those quadrupoles, which, if too large, would excite undesirable resonances 
causing emittance and/or beam loss. In electron-positron linear colliders, 
the emittance is small and the aberrations generated by the low beta inser- 
tion would cause sizable increase in beam size at the interaction point. As a 
result, the matching section between the low beta section and the linear accel- 
erator (linac) is rather complicated. Telescopes are used to minimize certain 
aberrations and sextupoles are used to correct chromatic aberrations. 
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FIGURE 9.11: Lattice functions of a typical low beta insertion with sym- 
metric quadrupole triplets. Here 6* is 0.5 m. 


9.3.2 The Chicane Bunch Compressor 


A simple yet very effective and commonly used module in linac based free 
electron lasers (FELs) is the so-called chicane bunch compressor. It consists 
of four identical rectangular homogeneous bending magnets separated by drift 
spaces, with the middle two magnets bending in the opposite direction, and 
the reference orbit perpendicular to the entrance of the first and third magnets 
and the exit of the second and fourth magnets (see Fig. 9.12). The whole 
module is mirror symmetric about the center. Such an arrangement ensures 
that the bunch compressor is achromatic to all orders and that electrons with 
higher energy go through shorter paths. When a bunch of electrons enters the 
compressor with a correlation between the longitudinal position and energy, 
the bunch length changes at the exit of the compressor. If the slope is negative, 
i.e., the electrons in the head of the bunch have lower energy, the bunch is 
compressed. 

Next, let us take a look at the basic optical properties of the chicane bunch 
compressor. The horizontal transfer matrix of the first bend is 


il 0 0 cos @ Rosing Ro (1 — cos ¢) 
M} = | 1/Rotanġ 1 0 —1/Rosinó  cosó sin ó 
0 0 1 0 0 1 


cosó Rosing Ro (1 — cos ¢) 
0 1/cosó tan $ 
0 0 1 


To obtain the transfer matrix of a bend magnet with the opposite direction 
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FIGURE 9.12: Layout of the chicane bunch compressor. The top/bottom 
trajectories are those of particles of lower/higher momenta than the reference 
trajectory in the middle. 


of bending, we have to first find out the transformation between internal and 
external coordinate systems. Taking into account the fact that positive x in 
the internal system (away from the center of the arc of the design orbit) is 
negative in the external system, the transformation is 


The transformation in the vertical plane is the identity matrix. Therefore the 
horizontal transfer matrix of the second bend is 


cos $ Rosing Ro (1 —cos¢) 1 0 0 
M2-$,| -1/Rosinó coso sind 1/Rotang 1 0 | $z! 

0 0 1 0 0 1 
-1 0 0 1/cos¢@ Rosing Ro (1 — cos ¢) —1 0 0 
= 0-1 0 0 cos $ sing 0-1 0 
0 01 0 0 1 0 01 

1/cos¢@ Rosing —Ro (1 — cos ¢) 

= 0 cos @ — sinó 
0 0 1 


The horizontal matrix of the first and the second bends separated by a drift 
Li is 


1/cos $ Rosin ó — Ro(1—cos ¢)\/1 L4 0\/cos¢ Rosin ¢ Ro(1—cos ¢) 


Mh- 0 cos b —sing 010 0 Icosd tano 
0 0 1 001 0 0 1 
1 2Rosinġ + Li/cos?$ [2Ro(1— cosQ) + Li tang] / cos ó 
=! 0 1 0 


0 0 1 
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From the mirror symmetry of the chicane, we can conclude that the module 
is achromatic. For ¢ < 1, the dispersion between the second and the third 
bends is D = Li ¢ + Ro¢?. It is worth noting that the focusing in the vertical 
plane is insignificant. As a result the bunch compressor is transparent in 
transverse dynamics. 


Finally, let us work out the path length difference between the reference 
electron and one that has a different momentum p = (1+ 6) po. Due to the 
symmetry, the difference can be obtained analytically, which is 


l— lo = 4 (Rẹ — Rogo) + 2L: (29 2 1) 


sọ 

where 

R= Ro (14-6), 
and 

R . 
sind = sin go = m 
Plugging in R and ¢, we have 
i 1 

l—lo = ARo fja + ô) arcsin (3) = | +214 __(1+4)cos¢o __ —1 


(1 +8)? — sin? do 


In order to have a better idea about the relation between l — Ip and 6, we 
would like to learn how the low order terms behave. Before we proceed with 
the Taylor expansion of l — lo, let us first work out that of arcsin(zo + Az). 


Using arcsin(z4/1 — y?4-yv1 — z?) = arcsin(x)--arcsin(y) and setting x = xo, 
we obtain 


roy 1 — y? + yA/1— 22 = ro + Ar. 


After straightforward algebra, we obtain 


y = (zo + Az) 4/1 — 22 — av (zo + Ar)’. 
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As a result, 


arcsin (zo + Az) 


— arcsin (zo) earesin Go TAzrx)jl-zà-—zoWl- (ao + Aa 


2xoA Ax? 
= arcsin (zo) -- arcsin le + Ar)jl 5 zo94/1 BEER 
V V — a2 


2zoAz--Azx? 4a2Ax? 
=p arcsin (xo) enin fani t T2-- XQ 1 bud Xo Gm x Te M | 
x9) 


3 (1-2) 
2x9A Az? oA qe 

=p arcsin (xo) +Azy1 -23+ y1 — TOAT MUR BE 
ECT (1— xå) 


Ax xo Ax? 
= ENT 
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To the second order, the path length difference is 


=» arcsin (zo) + 


| — lo =2 4Ro [(1 + ô) arcsin ((1 — 6 + 8°) sin ¢o) — 0] 
(1 + ô) cos ġo m 

V/ cos? dg + 26 + 6? 
= 4Ro (a + ô) arcsin (sin oo — (ó -— 6?) sin $o) — do] 
€— i — 

1 + (26 + 62) / cos? ġo 
= 4Ro fa + ô) E — (6 om 6°) tan do + D» tan? J = o) 
1204-0? 3 46 ) -1| 


+2Lı 


+2Lı 


+224 [re (1 2 


2 cos? do | 8 cos! do 
1 
2 
— 214 tan? do (ə E JE 


2 cos? do 


For ġo « 1, only the terms of the lowest order in $9 are important and we 
obtain 


| — lo => —2L1 0? (; = P) ; 


As an example, we discuss the parameters of the first bunch compressor 
at the Linac Coherent Light Source (LCLS) at SLAC National Accelerator 
Laboratory, California, USA, which operates at a beam energy of 250 MeV. 
It is worth noting that the bend radius Ro of 2.48 m (corresponding to a 
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FIGURE 9.13: Mechanism of an RF buncher cavity. 


magnetic field Bo of 3.36 kG) and L4 at 2.61 m are roughly the same and 
that the bend angle of $9 = 4.62°, which corresponds to a magnet length 
Ly = 0.2 m is very small. For the term (1|ó), the contribution from the 
magnets is —1.74 mm and that from the drifts is —34.07 mm. For the term 
(1]8?), the contribution from the magnets is 2.62 mm and that from the drifts 
is 51.44 mm. At this energy, the electrons are relativistic enough that the 
contribution from the difference in velocity is miniscule (—27 um for (1|ó) and 
40 um for (1|02)). 


9.3.3 Other Bunch Compressors 


As shown above, chicane bunch compressors work when the higher momen- 
tum particles are in the tail of a bunch. Yet higher momentum particles are 
often in the head of the bunch. An important example is the DC gun, where 
higher momentum particles are faster and thus arrive earlier. To compress 
such bunches, one method is to configure electrostatic or magnetic fields in 
such a way that the faster particles go through longer paths. 

Another method is to reverse the correlation between energy and longitu- 
dinal position before the bunch enters a chicane bunch compressor. This is 
achieved most commonly through an RF cavity, which because of its func- 
tionality is often called a buncher. It is a regular RF structure which is set 
up through adjusting the phase such that the mean energy of the bunch is 
unchanged. Meanwhile, the head of the bunch, where the high energy par- 
ticles are, is decelerated, and the opposite happens to the tail of the bunch. 
Fig. 9.13 illustrates the mechanism of a buncher. When the particles are not 
highly relativistic, an RF buncher and the drift space downstream can achieve 
bunch compression. This is called ballistic bunching. 


Chapter 10 


Synchrotron Motion 


Up to now we have been primarily concerned with the motion in the trans- 
verse planes. Yet, for particle accelerators, as the name implies, acceleration 
is the primary interest. T'herefore the motion in the longitudinal phase space 
has to be understood. Although there are many different ways of accelerating 
charged particles, we restrict ourselves mostly to circular accelerators (syn- 
chrotrons, to be specific), where the acceleration is done using radio frequency 
(RF) cavities. The only exception is the last section, where the transverse dy- 
namics of RF cavities is discussed, which is of great significance mainly for 
linacs. 


'The chapter is organized as follows. First, a section is devoted to a brief 
description of a typical RF cavity used in a ring. Second, the time-of-flight 
as the function of energy is derived. Next, combining the results from the 
previous sections, the map of the longitudinal phase space is obtained and 
the longitudinal motion is studied in detail. Last, the transverse effect of the 
RF cavities is discussed briefly. 


10.1 RF Fundamentals 


Most RF cavities used in synchrotrons are variations of the cylindrical pill- 
box cavity, which consists of two circular metallic plates of radius Re that are 
separated by the distance l and that are connected with a cylindrical mantel 
of radius Re, resulting in a geometry reminiscent of a circular pill box. 


The field distribution in the interior of such a kind of a metal box can be 
written in a simple analytical form. Following Wangler [70], the electromag- 
netic field of such a cavity of length | and radius Re with transverse magnetic 
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FIGURE 10.1: Typical RF cavity field with fundamental mode TM jo. 
Radial dependence of the normalized electric field E;(r,0)/ Eo (left) and the 
normalized magnetic field Bo(r, —1/4f)c/ E (right) are shown as a function 
of xoir/Re. 


field (mode of TMmnp) can be written as 


E, = EoJm (kmnr) cos (m0) cos T cos (wt), 


E, = FALCEM, (kmnr) cos (m0) sin T cos (wt) , 
[m T = EoJin (kmnr) sin (m6) sin P cos (ut) , 

B, =0, ü 

B, = expo Jm (kmnr) sin (m0) cos = sin (wt) , 
Bo = w — Eo Jl, (kmnr) cos (m0) cos = sin (wt) , 


where kmn = Umn/Re and w = ey k2,, + (pr/l)?. Note that the quantity £mn 
is the nth zero of the Bessel function Jm(x) (excluding the origin, n > 0). 

Usually the fundamental mode of TMo10 is used for accelerating charged 
particles, whose field is 


E, = Eoo (==) cos (wt), E,=0, Eg-0, 


E 
Bod. Heg. B=- (=) sin (wt) , 
c Re 
where the relation Jj (x) = —Jı (x) is used to obtain Bg (see Fig. 10.1.) 


Note that Bg is proportional to EZ with 90° phase lag, which is the result of 
Faraday’s law and that x9; = 2.405 which, together with the design frequency, 
determines the size of the cavity. Specifically, for the mode of TMo 0, we have 
101€ - Lore 

qe Dac fe 


Re = 
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For f = 500 MHz, R. = 0.2295 m. In realistic cavity designs, the actual 
shape of the cavity is often more spherical than cylindrical. Yet the overall 
dimension is not very far from this crude estimate. 

For r < Re, which is usually where the beam is, the field can be approxi- 
mated by the lowest order term of the Taylor expansion, which is 


Eo Toir 
E; =, ÈE t ; Bo =; —— 
1 0 COS (w ) 0 1 E 2R. 


sin (wt) . (10.1) 


Note that Jo (x) = 1+0 (a?) and Jı (x) = z/24-O (x°) for x « 1. As a result, 
the effect of Bg is much weaker than that of E, on the beam. Furthermore, 
the focusing effect of the magnetic field is usually negligible compared to main 
focusing elements, the quadrupole magnets in the ring. 

To illustrate this, let us look at an example. Let us consider again the case 
of a 500 MHz cavity. Assuming that Eg = 20 MV/m, which is not far from 
the breakdown limit of copper at this frequency, we obtain the peak gradient 
of the magnetic field, which is 0.35 T/m. Normal conducting quadrupoles, on 
the other hand, can have field gradient up to 20 T/m. Furthermore, there are 
usually tens to hundreds of quadrupoles with lengths between 0.2 and 1 m in 
a ring, whereas there are at most a handful of cavities with lengths usually 
below 0.5 m (around 0.3 m for a 500 MHz pillbox cavity). As a result, the 
integrated gradient of the cavities is on the order of up to perhaps 1/1000 
that of the quadrupoles. 

Recently, the dipole mode (TM110) has also been used to kick the beam 
transversely. The field of TM110 is 


X117 


E; = Eg ( R 


) cos 0 cos (wt), E,=0, Eg-0, 


c 


E c : ; 
B, — 0, B, = zt J. (Z) sin osin on), 


C Tiir : c 
E Re : 
Bg = x |^ (=) = um (32) cos O sin (wt) , 


where the relation Jj(%) = Jo(x) — Ji(x)/x is used to obtain Be. Note 
that 21, = 3.832. For the same cavity, the frequency of the TM;19 mode 
is 211/201 £ 1.6 times that of the TMoio mode. In order to get a clearer 
physical picture of the effect of the field on the beam, let us again perform 
Taylor expansion around the origin and keep only the leading term. The field 
is 


X117 
E, = E 
Pi OR 


cos cos (wt), 
C 


E E 
B, =, = sinsin (wt), Bg — = cosÓsin (wt). 
2c 2c 
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Opposite to the TMoi9 mode, the magnetic field has a much stronger effect 
on the beam. Furthermore, we have 


E 
DB, =1 B, sinf + Bgcos0 =; 2m sin (wt), 
c 


which is an alternating current (AC) dipole and is best suited for kicking 
the beam transversely. Again, using the example of a 500 MHz cavity and 
assuming that Ey = 20 MV/m, we obtain the peak field, which is 0.033 T. 
For such a cavity that is 0.3 m long and the energy of the electron beam being 
1.9 GeV, the peak kick angle is 


ev;ByoAt — eEgl _ 20 x 10° x 0.3 


0, — = ao 
Dz 2p;c 2 x 1.9 x 10? 


= 1.6 x 1078. 


The approximation that equates p,c to the total energy of the electron is based 
on the fact that the relativistic fact y is around 3800 and that the divergence 
of the beam is usually a fraction of 1 mrad. 

Now let us come back to the TMo1o mode and find out the energy gain 
(AK) per pass. To simplify the matter, let us consider a particle that moves 
along the optical axis and the energy gain per pass is much smaller than its 
total kinetic energy (K), which entails that the change of velocity in the cavity 
is negligible. As a result, the energy gain is 


[i 


AK =q | Ez (0,2,t(2))dz, 


L 

2 

where t(z) = to + z/vo. Here we set the origin of the z-axis at the center of 
the cavity. For TMo19 mode, we have 

i 

2 


i L 
AK =q : Eo cos [wt (z)] dz = q Eo cos l^ (1 + =)| dz 
0 


L L 
2 2 


$ $ 2 
= aE | cos c + x) dz — Eo | cos (o + =) dz 
E vo e Bor 


= qEol cos (ġo) Haa 


where A is the wavelength of the electromagnetic field and 89 = vo/c. The sin 
function in the equation above is the result of the finite length of the cavity, 
which is called the transit time factor (T). For the TMoio mode of a pillbox 
cavity, the transit time factor is 


_ sin (zl/ oA) 
e Tl / Bor i 


and the energy per pass is 


AK = qEolT cos (¢o). (10.2) 
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It is clear that the transit time factor is the ratio of the energy gain of a RF 
cavity to that of a DC (direct current) gap of the same field. The relation 
between the transit time factor and the length of the cavity is shown in Fig. 
10.2. It is obvious that T — 1 as l — 0 which means that, for constant voltage 
between the gap, the shorter the gap, the closer the energy gain to that of 
the DC gap. Yet the electric field breakdown limit of the material determines 
the maximum field that can be achieved, thus the energy gain is proportional 
to IT, which is in turn proportional to sin(zl/9A). Therefore, the maximum 
energy gain for the case of constant field is obtained when l = 89A/2. For 
electron storage rings such as those of the synchrotron light sources, fo z 1. 
So we have | = A/2, which corresponds to the fact that the time an electron 
takes to pass through the cavity equals half of the period of the oscillation. 
The transit time factor is T = 2/7 = 0.637. For a 500 MHz cavity, we have 
| — 0.3 m. 

In addition, other issues such as RF power efficiency also have to be taken 
into account. The most used parameter measuring the efficiency is called the 
shunt impedance, which is defined as 


where 
AV = EolT cos(Qo) , 


which is the voltage across the accelerating gap and P4 is the power dissipated 
in the wall. As a result, realistic normal conducting cavities are more or less 
spherical in shape, minimizing the total surface area, with nose cones around 
the beam axis to reduce the length (acceleration gap) of the cavity, maximizing 
the voltage across the gap. For superconducting cavities, the dissipated power 
is much smaller and hence the main goal of design optimization shifts to 
minimizing the peak field on the surface for a given on-axis field to reduce the 
risk of costly quench. The resulting shape is basically the bell shaped cavity, 
which is preferable for other practical reasons as well. 


10.2 The Phase Slip Factor 


Toward the end of the previous section, we studied the energy gain per pass 
of one particle. In this section, we will study the energy gain of a particle 
over many passes in a circular accelerator. Since the cavity is always designed 
for a given accelerator, there is at least one particle (the reference particle) 
that comes back to the cavity at the same phase (synchronous phase ¢,) 
every turn. For an arbitrary particle, it may not come back to the cavity 
at the same phase since it may have different energy. Although an arbitrary 
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T4 


FIGURE 10.2: Dependence of transit time factor on length of the cavity. 


particle may not move along the reference orbit, which is a closed curve, 
the effect of the change of arrival time due to the transverse motion tends 
to average out due to betatron oscillation. This will become clearer below 
when the large difference between the frequencies of the transverse oscillations 
and that of the longitudinal one is shown. As a result, we only consider a 
particle that moves along the close orbit of its energy. Let us consider a ring 
with midplane symmetry that uses pure magnetic elements for bending and 
transverse focusing. Hence eq. (3.17) can be simplified as 


1+ 
= |a + has) TIL 29 


— lc + has) — i 


i | + has) (1 F E. 


T N0 Ps (6) 


az 
vo 
24 
n(2 +7) sa ea = 
no (2 + no) vo 


where x5 and as are the horizontal position and momentum of the closed 
orbit of an off-momentum particle. The difference in arrival time between the 


reference particle is 
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The phase slip factor is defined as 


ph = OA 
tod 
1 F 1 
no T To no 
= 1— (1-4 hz (i+ 2 mu" 
votod | ? no 2 + no 2+ no a 
= qp ghé... (10.3) 


Note that the variable ô is defined as AK / Ko. In a pure magnetic system, mo- 
mentum is the more natural variable since it scales linearly with the magnetic 
field. To this end, we recall eq. (3.16) and have 


A^ 
fog SD n= (14-0), 
Do Do 


Using the relations 


we obtain 


(1+8)? = (1+ ô) (1+ e). 


After a little bit of algebraic manipulations, the exact functional relation 
between ó and Ap/po is obtained, which is 


1 2 
SO Duo 14 WAT MO) oy +82) |. 
no (1 4- n)? 
As a result, the phase slip factor can be written as 

jh = _ At 

todp 

1 j (2 
— 1— (14 has), /1+ TEL m (26, + 62) V (1+ dp) ae ds 
votoðp J 

= Ph + Gy + (10.4) 


Taylor expanding eq. (10.4) to the leading order and taking into account the 
fact that 


we obtain 


sk 2e 1 1 f€D(s) 
m = (0 4 m)? T m — E ds. 
= +) y CJo p(s) 
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The first term is phase slip due to the difference in velocity and the second 
term is that from the difference in path length, which is called the first or- 
der momentum compaction factor a1. Since the velocity difference is inverse 
proportional to the square of yo, it becomes smaller as the energy increases. 
At the point that the two terms are equal, the phase slip factor changes sign, 
which is called the transition, which is 54, = 1/ ay. The significance of the 
transition is that the synchronous phase changes from a stable fixed point to 
an unstable one or vice versa. Specifically, for energy below transition, we 
have np” > 0, which entails that particles with higher energy arrive earlier. 
Therefore the synchronous phase is a stable fixed point when $, lies between 
—r/2 and 0 (Fig. 10.3). For energy above transition, we have 7?" < 0, 
which entails that particles with higher energy arrive later. Therefore the 
synchronous phase is a stable fixed point when $, lies between 0 and 7/2 
(Fig. 10.4). In practice, the phase of the RF cavity has to be changed quickly 
from $, to —@, in order to keep the beam confined, which is called the transi- 
tion jump. It is not unusual that during acceleration, most beam loss occurs 
around transition jump. Figs. 10.3 and 10.4 also show that the maximum 
energy width of particles confined in the longitudinal phase space increases 
when the peak voltage of the RF cavity increases and/or In?^| decreases. 

'The second order effect becomes important when the first order term is 
small enough. From eq. (10.3), we can easily obtain the second order phase 
slip factor. The only complication is that the second order term of 6p has to 
be included. To the second order the relation becomes 


Plugging the equations 
v5 = Dôp + D255, a5 = D'à, 


into eq. (10.4) and expanding it to the second order, we have 


C 
i =-s5(8-4)-af (soe - 299 - 229)a. 
2% Yo C Jo 2 p(s) 7 p(s) 

The integral form of the phase slippage factor shows clearly which quantity 
contributes. For example, only dispersion in the bending magnet contributes 
to n?" and the slope of the dispersion everywhere contributes to r5". Knowl- 
edge of this kind helps greatly during the design of a ring. The computation 
of the phase slip factor, on the other hand, can be done through applying the 
periodic solution of the dispersion function (including nonlinear terms) to the 
fifth variable of the one turn map, which is 


Ph = —— = ——M, (x5, 45, Ôp) . (10.5) 
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V AE, 


FIGURE 10.3: Sketch of phase stability for energy below transition, show- 
ing stable and unstable motion near the fixed points ¢ = $, and ¢ = —6., 
respectively. 


Again, the fifth and the sixth variables here are defined as —vo (t — to) and 
dp. The one turn map M can be easily obtained using Differential Algebraic 
(DA) technique and the periodic solution of the dispersion function up to 
arbitrary orders can be obtained using the procedure of finding the parameter 
dependent fixed point (see Section 8.2.1). As an example, we study in detail 
the first order phase slip factor. The one turn linear matrix of the horizontal 
and the longitudinal phases spaces can be written as 


CI 
M=T le) (ja) 1 (15) 
0 0 0 1 


m= 5 (11) D + (1a) D" + (18)] . (10.6) 
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V AE, 


FIGURE 10.4: Sketch of phase stability for energy above transition, show- 
ing stable and unstable motion near the fixed points ¢ = $, and ¢ = —6., 
respectively. 


Using the (/|x) and the (I|a) equations of (5.10) and eq. (8.3), we find that 
(Ja) = (ala) (xô) — (vl) (alô) 
= (ala) [1 — (a|x)] D — (x]a) D '] — Gel) [—(a|2)D + [1 — (ala) D] 
= (a|z) D + [1 — (z|z)] D’, (10.7) 


Il 
& 


(tla) = (a|a)(|) — (x|a)(al2) 
a) [It — (z|z)] D — (ala aid — (zļa) [- (a|x) D + [1 — (a|a)] D'] 
= — [1 - (a|a)] D — (z|a)D (10.8) 


As result, we have 


nh = 5 ((alz) D? + ((ala) — (12)] DD! — (2la)D"? + (t) 


_ sind 2 ! 12 (IJ) _ - 
Sa (4D? + 2a,DD' + 8;D'?) + = a -5 [Hsing (18)] . 


Before finishing the section, let us find out the effect of a sextupole on 
the second order phase flip factor. Let us consider a sextupole with strength 
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ks = b2/ Bp. For a thin slice of it, the effect is Aa = —k,dsx?. From eq. (11.1), 
we obtain the change of the periodic solution of the second order dispersion 
function at the location of the sextupole slice, which is 


a k ds D? ( B cos (xv) k 


ADj] 2sin(xv) \sin (nv) — a cos (nv) 


where v = 6/27. Plugging it into eq. (10.6) and using eqs. (10.7) and (10.8), 
we obtain 


Ang (s) = 5 [(Ulz)ADz + (a) ADI] 


1 k,dsD? 
~~ C2sin (rv) ui 
— [1 — (a|a)) D + (x|a) D "| (sin (xv) — a cos (mv))}. 


a|z)D + (1— (2|x)) D'] 8 cos (nv) 
After straightforward algebraic and trigonometric manipulations, we arrive at 
a simple result, which is 
ph 1 3 
Ans (s) = GksD ds, 


and the total change of the second order phase slip factor is 


1 C 
Ang" = = | ky (8) D*ds. 
0 


Using the same method, we can easily obtain the result where both horizontal 
and vertical dispersion are present but coupling is corrected. Here the kick of 


the sextupole slice is 
Aa r? — y? 
a) has ( —2xy J’ 


and the change of the periodic solution of the second order dispersion function 
at the location of the sextupole slice is 


[1 NEL (DÈ — D?) | Bx cos (TV) 


ADS, 2 sin (114) sin (714) — o4 cos (114) 
ADya E k,dsD, D, By cos (mv) 
ADI sin (nv) sin (Ty) — ay cos (ty) J ` 


Plugging it into the 4D version of eq. (10.6), using eqs. (10.7) and (10.8) and 
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the similar relations for (I|y) and (I|b), we obtain 


Ang (9) = 5 [lA Doa + (a) ADi + (ly) A Dua + QI) Dj] 
1 ksds (D2 — D?) 

C  2sin(xv;) 

— [(1 — (a|a)) Ds + (x|a) D;] (sin (mvr) — az cos (1v4))) 
C s (fO) D, + (1 — uo) Dj] By cos (rwy) 

- [( — (880)) Dy + (vlb)D;] (sin (vy) — ay cos (4))] 


{[(a|x)Dz + (1 — (2|a)) Dz] Be cos (nve) 


Taking into account the fact that the vertical part in the curly brackets is the 
same as that of the horizontal part, we immediately arrive at the final result, 
which is 


1 C 
Arp? = ral ks (s) (D$ — 3D4D7) ds. 
0 


10.3 Longitudinal Dynamics 


Based on the previous two sections, we can construct the one turn map 
with the RF cavity present. From eq. (10.2), we can write the general form 
of energy gain 

AK (r,t) = qVo (r) cos [o (t)] . 


where ¢(t) = wt. For the TMo1o mode of a pillbox cavity, we have 


c 


V (r) = End (=) LT, 


where L, instead of l, is used to represent the length of the cavity to avoid 
confusion. Converting to the canonical coordinates, we have 


$() = do - 21, 


and 
Ko AK (r,l) — AK (0,0) 
sre Sea ee 
o) = RRO) Ko + AK (0,0) 
Ko qEoLT 


=; + —————————— 
Ko + qEoLT cos (ġo) Ko + qEoLT cos (o) 


|^ (=) cos (0 n "Ü — cos (v) 
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As shown in Section 10.1, the transverse focusing is negligible. Together with 
the fact that the change of velocity is insignificant, the cavity is simply a drift 
space for the variable x, a, y, b and l. As a result, we can treat the cavity as 
a thin slice with a kick in energy and the map of the cavity is 


Tf = Ti, af = — üi, 


P 
yp=u, b= b, =k, 
Pof 


-~ Ko +qEoLT cos (ġo) ^ Ko + qEoLT cos (do) 


2 2 
d [i cos (0 + zu) — cos v) 


Re 


Of 


where 


Poi = y (Ko + ME)? — meet, 


pof = V (Ko + qEoLT cos (do) + m2)? — mct. 


Obviously, the relative transverse momentum decreases as the particles are ac- 
celerated and so is the phase space volume, even though that for the variables 
(£, Px, Y, Py, -At, AK) is conserved. 

It is clear that when |óo| < 7/2, the reference particle is accelerated and the 
relative energy deviation of the particles at the vicinity decreases on average. 
Together with the rest of the ring, we have the one turn map, which is 


MT = MCAV o MAING, 


For the most general case, x, a, y and b are functions of 6 and l is a function 
of x, a, y, b and 6. As a result, the cavity couples the longitudinal degree of 
freedom to the transverse degrees of freedom. Yet, due to the large difference 
in oscillation frequencies which will become clear soon, the coupling is much 
weaker than that between the horizontal and the vertical planes. This is par- 
ticularly the case when the cavity is located in a dispersion free region, where 
coupling is limited to the nonlinear part of the map. In reality, it is common 
practice to place cavities in dispersion free regions to achieve separation of the 
longitudinal and the transverse motions. In the rest of this section, we always 
assume that there is no dispersion at the location of the cavity and ignore the 
chromatic terms in the nonlinear part of the transverse map (x, a, y and b). 
Furthermore, we ignore the spatial dependence of the accelerating field due 
to the fact that r « Re and the difference is second order in r. Consequently, 
the longitudinal degree of freedom is decoupled from the transverse degrees 
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of freedom and the longitudinal map is 


lg =1,+M, (ĉi), 
jain Ko qEoLT 
f Ko + qEoLT cos (¢o) Ko + qEoLT cos (ġo) 


: [cos (0 + Zi) — cos (40)| à 


ði + 


Next, we keep only the linear terms in the map and solve for the oscillation 
frequency. The linear map is 


ly =l; + (0|9)ó;, 
Ko 
ép ——À— 5 
! Ko + qEoLT cos (ġo) 
qEoLT w 


Writing in matrix form, we have 


iv Wis arik 
0g) \ Mor Mis) \ 6; 


where 
qEo LT ts 
Mac T v 
3! = Ko + qoLT cos (go) e (20), 
1 wW 
M E K ELT (l|6) si l 
Wu a e te Uem Con) 


It is easy to verify that the determinant of the matrix is 


Ko 
Ko + qEo LT cos (ġo) , 


which entails that the motion is non-symplectic when qEoLT cos($o) 4 0. 
Furthermore, the longitudinal emittance of the beam decreases as the beam is 
accelerated. It is obvious that the same is true for the transverse emittance, 
which is called adiabatic damping. By redefining the relative energy deviation 
as 


we obtain the new matrix 


EN. (a Q)YV(G 
bp) (My Mz) NJ. 
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where 
qEoLT w 
a — ja = sin (do) , 
1 QU. ER. us 
Mg; = = [Ko + aEoLT (1) sin (¢o)] , 
0 K 


which is symplectic. The trace is 


^ 1 UG. 
iM = 1 ue [Ko + gE gL T= (15) sin (¢0)| 


nig CRORE R iau. 
Ko K 

For (1|) > 0, tr M <2 if 0 < ġo < c; for (10) < 0, tr M <2 if -r < do < 0. 
In other words, for energy below transition, the synchrotron motion is stable 
if the synchronous phase $o € (—7,0) and, for energy above transition, the 
synchronous motion is stable if ġo € (0, 7). If we restrict ourselves to the case 
of acceleration, the stable interval of the synchronous phase is (—7/2,0) below 
transition and (0,7/2) above transition. This is the quantitative statement 
of the fact mentioned in the last section. Using the relation 


tr M = 2 cos (u), 


we have 
qEoLT 
2Ko 


It is worth noting that for most accelerators the relation 


Ht = arccos |1 — 


(—&) (1|) sin ($0) 


qEoLT 
2Ko 


(—5) (118) sin ($0) < 1 


holds. Making use of the fact that 


DE dv A E 
arccos x = arcsin y 1 — x^, 


we obtain 
arccos (1 — x) = arcsin 4/ 1 — (1 — x)? = arcsin (v 2x — 2?) =, V2e. 
As a result, we have 


| [qEoLT 


27h (qEo LT) nf" sin (óo) 
Ht = 
Ko 


(76) (1) sin (du) = of SE mtn). 


Note that h is the so-called harmonic number, which is the ratio of the RF 
frequency to that of the revolution frequency of the ring wo, and np” is the 
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first order phase slippage factor defined in eq. (10.3). The quantity p: is 
called the synchrotron tune which is proportional to the square root of the 
harmonic number, the accelerating voltage and the slippage factor. Usually 
the slippage factor is expressed in terms of Ap/po, as defined in eq. (10.4), 
denoted here as 77. Hence the relation between the two is 


Taking into account that j4 is the phase advance per turn, the synchrotron 
tune in terms of revolution per second can be written as 


me 2th (qEoLT) rj, sin (do) = h (qEoLT) n? sin ($9) 
"^ do Bo ome? i 2r f ome? 


which is the usual form that appears in most textbooks. For a circular ac- 
celerator with GeV level energy, the synchrotron tune w+ is usually between 
0.196 and 196 of the revolution frequency wo. The betatron tunes, on the other 
hand, are usually between a few to a few hundred times of wo. As a result, 
the coupling between the betatron and the synchrotron motions is usually 
weak. As an example, let us take a look at the storage ring of the Advanced 
Light Source (ALS) at Lawrence Berkeley National Laboratory (LBNL, LBL), 
California, USA, which is an electron machine that operates at the energy of 
1.9 GeV. The main purpose of the RF cavity is to restore the energy loss due 
to synchrotron radiation from bending magnets and insertion devices, which 
is on the order of 0.5 MeV per turn per electron. The harmonic number is 
328 and the slippage factor is roughly 1.4 x 1073. As a result, we have 


g, 328x LA x 10-9 x 0.5 = 
MEL ee ee ere eds 
X On x 3718 x 0.511 Sum 


Using eqs. (8.1), we obtain 


E qEoLT w 1 
Qt = - 2Kosin ju X (9) sin (ġo) zd at 
_ (is) _ Ws) 
sin Ju Le i 
u qEoLT w , pu 
— Ko sin p K sin (¢o) i (9) ° 


Since ug « 1, we have a; < 1. Consequently, the invariant ellipse is basically 
upright. For a given longitudinal emittance e+, the maximum bunch length is 


= -22y Beer =1 22 UUES 


Ht 
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FIGURE 10.5: Phase space plots of longitudinal motion with ¢, = 7/2 
(left) and ¢, = 7/3 (right). 


and the maximum energy spread is 


Ht€t 
max — 2 =; 2,/—_. 
Ù vitet —1 (15) 


Using the relation 


KC 
(l8) = —Kton?” = era 
we obtain ‘ 
P 22 2 Di 
lA ni Bo yonec 
bma =2 DL /LE TIN LA ? 
ee (za (qEo LT) sin =) 
and 


i = 9%, [& 2th (qEoLT) sin (¢0) 1 
cae KVC ni Ba^yomce? ` 


The main result is that lax « (rf y if other parameters remain unchanged. 
Hence, one way to reduce bunch length is to decrease the phase slippage factor. 
Fig. 10.5 shows the details of the dynamics of all amplitude for ¢, = 7/2 (left) 
and $, = 7/3 (right). Note that particles that are outside the stable region 
(called the RF bucket) lose synchronicity with the electromagnetic field in the 
RF cavity. For the case ¢, < 7/2, those particles would not be accelerated 
the same way as those inside the RF bucket and usually will be lost. 


10.4 Transverse Dynamics of RF Cavities 


Up to now, we have only treated the main effect of RF cavities, which is 
to accelerate (or, occasionally, decelerate) charged particles. We have shown 
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FIGURE 10.6: Longitudinal (solid) and transverse (dotted) field distribu- 
tion along the longitudinal axis. 


that, in order to accelerate the charged particles effectively, the synchronous 
phase has to be a stable fixed point, limiting the synchronous phase to just 
one quadrant (see Section 10.2). As shown below, this has significant effect 
on the transverse dynamics of linacs. In rings, the transverse effect of RF 
cavities is negligible due to the presence of magnets. 

Let us start with the longitudinal distribution of the electromagnetic field 
of a pillbox cavity with small holes at the center of each end, which will 
be derived from Maxwell’s equations. From eq. (10.1), we can express the 
longitudinal component of the electric field as 


E, (z,t) = Eo cos (wt + o) H (z + L/2) H (b/2 — z), 


where L is the length of the cavity and H is the Heaviside step function. From 
V- E — 0, we obtain 


10(rE,) OE; 
- =Q. 
r Or u Oz 
After integration, we obtain the leading order transverse component of the 
electric field, which is 


OE: 

, = LC = LB cos (wt + p) [6 (z + L/2) — 6 (z — L/2)]. 
2 Oz 2 

Fig. 10.6 shows the longitudinal dependence of E; and E, at an instance. 

From (V x B); = (1/c?) (OE; /0t), we obtain 


10(rBg) _ 1 OE. 


r Or c? Ot 
Similarly, we obtain the leading order magnetic field, which is 
r OF, wr 


From the Lorentz force law, eq. (1.1), we obtain 


dpr 
dt 


=q (Er m v; Bo) . 
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As a result, the change in transverse momentum across the cavity is 


L/2+e E 


—L/2—e Uz 
= -4mo | TL cos (wti +p) — 72 cos (wt + 2 
zl z2 
L/2 
qw : 
TIE, dzr (z) sin [wt (z) + qj] 
2c? —L/2 
q rı T2 
c — — Eo | — cos (wt + p) — — cos (wt2 + 
2c ° E Son) Bo We 9) 
Bi L/2 
+ — Eo dzr (z) sin [wt (z) + 9], (10.9) 
2c? —L/2 
where T1, Uz1; ti, £1 and T2, Uz2, t2, B» are values of T, Uz; t, B at z = -L/2 


and z — L/2, respectively. When the charged particle is non-relativistic, i.e., 
B « 1, the contribution of the magnetic field is negligible. Hence eq. (10.9) 
becomes 


Ap, — NC On Iz cos (wt; + v) — 72 cos (wta + v) 
2c | Ay Bo 


where tı = — f°, ja dz/[B(z)e] and t = f? dz/[8(2)c]. Note that the origin 
of t is set at the moment when the article is located at the center of the 
cavity. If we make one more assumption that |82 — £1|/8(0) « 1, which 
means that the energy gain (loss) through the cavity is much smaller than the 
total energy of the particle, we obtain that tı = —L/(2Go9c) and t9 = L/(2foc) 
(Bo = 8(0)). Consequently, we obtain 


"T" LE 
ons JP. VE t e)-3 A jus cz e) 
ig TL TL 
3 i (2x-9)-5 y (25*9)] 


Let us take a look at drift tube linacs as described in Section 1.3.2. Phase 
stability requires that —7/2 < p < 0 (see Fig. 10.3). Efficient use of 
energy requires that the particles are accelerated throughout the gap, i.e., 
cos (v — TL/ (89À)) > 0 and cos (p + *L/ (891)) > 0. As a result, the par- 
ticle is focused at the entrance of the gap and defocused at the exit, as 
shown in Fig. 1.13. Furthermore, we note that cos( — *L/(89A)) > 0 
entails that y — v L/ (8A) > —7/2, which leads to «*L/ (8A) < Y + 1/2 
and y 4 mL/ (BoA) < 2€ + 7/2. If —4/2 < p € T/4, p+ 7L/ (BoA) < 0 
and we obtain that cos (p + vL/(foA)) > cos (p — vL/ (BoA)). If —7/4 < 
yp « 0, p+7L/ (foX) can be positive where cos (p + 1 L/ (89A)) decreases as 
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p + 1 L/ (Go) increases. From the fact that 


+B - e deer o) 
POMA AND gl) RA eer ee 


we reach the same conclusion that cos (v + 7 L/ (894)) > cos (p — tL/ (892)). 
Although we have 6; < 65, yet 85/01 is usually much smaller than cos(q + 
7 L/(£9A))/ cos(y — 1L/(8oA)). As a result, the net effect is defocusing. The 
remedy in the early dates was metal foils or grids placed on the entrance of 
the drift tubes (exit of the accelerating gap) to remove the defocusing force 
as shown in Fig. 1.13. Nowadays, quadrupole magnets are placed inside the 
drift tubes to provide transverse focusing. 

At higher energy, the particles become relativistic and the contribution of 
the magnetic field has to be taken into account. Yet the matter is simplified 
somewhat by the fact that the difference between 3, and £5 can be neglected. 
Assuming also that r remains a constant in the cavity, we obtain 


qr TL TL ) 
Ap, = ——— Ep |cos | —~ — — cos | —— + 
uU | (25 e = i 


qrEo . ( aL ) á 
Ee sin | —— | sing 
Borge Bor 
"nqEoTLr , 
———— — sin q. 
Bici Er 


Again, for the stable phase of —7/2 < y < 0, the net effect is defocusing. It 
is worth noting that, for ultra-relativistic particles, this effect goes away. As 
Bo — 1, the defocusing from the electric field is canceled by the focusing from 
the magnetic field. 


Chapter 11 


*Resonances in Repetitive Systems 


Unlike single pass systems, the dynamics of the beam in a repetitive system 
such as a storage ring is not necessarily dominated by the largest aberrations 
in the one turn transfer map. Due to the fact that particles go around many 
times, the impact of those terms that are nearly in phase with the linear 
motion are amplified and the dynamics is, to a large extent, shaped by 
them. The motion generated by one of those terms is called a resonance. As 
a result, we need a different way to evaluate the relative significance of the 
aberrations that is directly suited for rings. 


Since resonances appear in various physical systems, many different meth- 
ods have been developed to describe this phenomenon. Here we have adopted 
a method that is based on the map method that has been developed here and 
does not require advanced techniques such as normal form theory as in [5]. 


As shown in the previous chapters, a large class of single pass systems 
are imaging. Yet the entire ring cannot be imaging, since it will be linearly 
unstable if |M| Z 1, where M is the magnification. Moreover, the ring is 
unstable with the presence of arbitrarily small errors when |M| = 1. 


11.1 Integer Resonance 


In this section we will study the dynamics in a ring when one or more dipole 
magnets have errors in the field. Let us first consider the case that one magnet 
has a dipole error in the field, which is AB. Without lost of generality, we 
adopt the thin lens approximation, since a thick dipole can always be cut into 
a number of thin slices and the contribution of the whole is the sum of each 
slice. The kick resulting from the error is ABl/Bp. Thus the position and 
angle after one turn is 
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where M is the one turn map of the ideal ring and the kick happens at the 
end of one turn. Similarly, the position and angle after two turns is 


(2) = Captian) ** (2) 
zx Ca +M mr) SMS (2) 
= (f+m) [nnde + MP E 
Now let us assume 


Xn—-1| _ (F y m ^rn—2 0 ^rn—1 [ XO 
(Set) = (ir +M ) (anig) +% e. 


and we obtain 


2(1 — cos p) 


1 1 —cosnp — asin npu —Bsinny 
ysinnu 1 — cosnu + asin nyu 


1 — cos u + a sin u DB sin 
l —ysin u 1 — cos u — asin u 
= sin(nu/2) ( sin(nu/2) — acos(np/2) — cos(np/2) 
sin(u/2) "y cos(nu/2) sin(nu/2) + a cos(nu/2) 
l sin(u/2) + a cos(u/2) B cos(u/2) 
—ycos(u/2) sin(u/2) — o.cos(u/2) 


sin(nu/2) e-d. 
sin(u/2) 


Note that 


ap - G u(n—1)/2 + asin u(n—1)/2 B sin u(n—1)/2 


—ysin u(n—1)/2 cos u(n—1)/2 — asin u(n—1)/2 
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When u — 27k, sin(nu/2)/ sin(u/2) — n. The result is that the motion is 
divergent in phase space and eventually the beam will be lost. In other 
words, the particle is called in resonance when u = 27k for some k. Since 
errors in dipole magnets are unavoidable, the only way to avoid this resonance 
is to adjust the tune away from any integer. 

In theory, the phase space coordinates become arbitrarily large only when 
the tune is infinitely close to an integer. But in practice, the finite size of 
the beam pipe defines a finite interval around an integer such that the beam 
will be lost, which is called the stop band. In order to determine the stop 
band of a given machine, the size of the beam pipe and the dipole errors of 
the magnets have to be known. The source of dipole errors can be either 
the field errors in the dipole magnets and dipole component generated from 
misalignment of multipoles (quadrupoles, mainly, and sextupoles to a lesser 
extent). Since the dipole errors act on every particle the same way, it is 
sufficient to study the motion of the so-called centroid of the beam only. In a 
ring, this centroid, which is called the closed orbit, is the periodic solution of 
the one turn transfer map. With the presence of dipole errors, the one turn 
map is not origin preserving. Because the distorted closed orbit is usually 
close to the design orbit, only the linear part of the map is included in the 
treatment. The periodic solution of a single error at sọ is obtained through 


the equation 
(2) = (azn to) 2o) * C): 


As a result, we have 


Fo z (r- 8) buen TN 


_ (ABI) (so) 1 B cos(1/2) 
~ Bp 2sin(u/2) V sin(u/2) — acos(u/2) J ` 


From eq. (6.10), we obtain the closed orbit at an arbitrary location s 
Leo(s) = vy Bs / Bso (cos ó +asin $) to + y bs bso sing ao 


ABI 1 z n 
= CA LE [VB Bo (cosġ + asosin $) Bso cos 5 


+v Bs Bso sin ó (sim 5 — Aso COS £)] 
_ (ABI), cos (nv — d) 
= Bp Seine)” 


where ġ, Bs, Bso, eo and (ABI).o represent ó(s) — ó(so). B(s), B(so), a(so) 
and (ABI) (so), respectively. When n dipole errors are present, the closed 
orbit is 


(11.1) 


1 ? (ABI) (5m) 


Leo(s) = 2sin(u/2) — BP 
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11.2 Half-Integer Resonance 


In this section we study the effect of quadrupole errors. Again, let us 
first assume that only one quadrupole has an error in the field gradient, which 
is denoted as Ak. Without loss of generality, we assume that this quadrupole 
is located at the end of a turn and that it is thin. As a result, the linear one 
turn map is 


X1 = 1 0 o X0 
(2) = Can 1) 8 (a) 
p3 1 0 cos p. + asin u sinp To 
~ (Aki 1 —ysinu cosp — asin y ao j` 
In order to simplify the calculation, we adopt a new coordinate system such 
that the unperturbed motion is a rotation. As a reminder of the general 
theory of transformation, let us assume that a matrix A transforms 7 into y, 


i.e., 7 = ÂZ. Assuming another matrix M is the linear one turn map in the 
space of 7, which means that 


Zı = Mi. 


Multiplying A to the left and inserting A-!A between M and zo, we obtain 


and hence a n 
= AMA gp. 
It is straightforward to verify that 


a (uB 0 
a//B vB 


1/V/B 0 \/cosu+asiny sinp VB 0 
aA B VB —ysinu — cosu — asin uj A —oa/ /B 1/A/B 


BRL 


— sinu cosp 


implies 


AMA = | 


'The relation between the coordinate systems is 


NM 


Resonances in Repetitive Systems 265 


The one turn perturbed map in the new coordinate system is 


m ME peal ee aioe © ae SU 
EN EA o) “(Akl 1 de a 
1 0 ^ To 
- (12) (8) 


where AK above is defined as AK = AkIf. After n turns, the coordinates 


——— (ese) 


In order to illustrate clearly the nature of the dynamics, we treat the problem 
perturbatively. To the first order, the coordinates are 


(RD) + (_ n) CIENT (_ n» j| (Ro) 
be (à) 2 ;Jito 
3 L Co do m mon 


o3 bed ud m)u] ic eei] G 


cos mj cos [(m — m)u] | cos mp sin [(n — m)y] 


= [Rem - AK ( 4 g ) EU [5 FE 


cosny sinn cosnu sinnu 


_AK e ,sin[2m— nu] ER cos [(2m — my) 2) 
(2. NEZ cos [(2m — n)a] - 325 sin [(2m — n)u] 


In order to simplify the expression further, we take advantage of the trigono- 
metrical series 


< " . sinfe + y(n — 1) /2] sin (yn/2) 
5 sin (x + my) = E tu 


ve | Cos [z + y (n — 1) /2] sin (yn/2) 
5 COS (a T my) == O  sn(y2 - 
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Specifically, we have 


5 sin (2m — n)y] 
= Y sin [2 (m—1)p— (n—2)4] = Y^ sin[- (n — 2) n 
m=1 m=0 
_ sin — (n—2) w+ 2u (n — 2) /2] - sin (n — 1) ui] = 
sin u f 
and 
Y cos [(2m — n)u] 
= Y cos [2 (m -15-(n-2)4]- Y^ eos[- (n — 2) p+ 2g] 
m=1 m=0 
_cos[— (n= 2) w+ 2 (n— 2) /2}-sin[(r— 1) pn] _ sinfin- 1) 4) 
sin yu sin u ` 


Hence the final result is 


En A 0 0 
( J- iow) - a ( ; ) 
ün cosnu sinnu 
AK sinnu —cosnpy AK sin|(n — 1)u] [ 0 1 To 
— —— (n — 1) . — — M EE 
2 cosnu sinnu 2 sin u 10 ao 
When u — 2rk or 2z(k + 1/2), sin[(n — 1)u]/sinu — n — 1. Hence the 
resonance is called half-integer resonance. 
When the tune is a certain distance away from the half-integer, the motion 
is still stable, but the invariant ellipse and the tune change. To obtain the 


perturbed invariant ellipse and tune, only the one turn map is needed. From 
the one turn matrix 


Kn = 1 0 cos p + asin u sinp 
|»OX—AKl 1 —ysinų — cosu — asin p 


cos u + a sin u DB sin 
-~ \—ysin u — (AK/8) (cosu +asin u) —AK sin u + cos i — asin u J^ 
we obtain 


2 cos (u + Au) = 2 cos u — AK sin p, 
cos (u + Aw) + (a + Ao) sin (u + Av) = cos u + asin p, 
(B + AB) sin (u + An) = B sin p. 
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As a result the changes are 


( AK . ) -sinu AK 
Ap = arccos cosu — -y sin — p= —— mL —— 


—sinu uu 
cos u + asin y—cos (u+ Aqu) (a + AK/2)sin u 
sin (u + Ap) sin u cos (Ap) + cos u sin (Ap) 
AK AK AK 
=, | a + — 1— ——cotu| — a =; ——(1- acot u), 
2 2 2 
Bsin p Bsin p 


~ sin (ut Ap) | a sin pcos (Ap) + cos usin (Au) B 


AK AK 
= 6 (1-5 cot) - 9 = - P atin 


where we remind ourselves of AK = Aki. The final relations are 


AK 
Au =1 >> 
AK sin u — a cos u 
Aa a Oer —= = 
2 sinu 


Af _ AK cos 
B | 2 snp 


It is worth noting that the invariant ellipse becomes infinitely large when 
v — k/2. Like the closed orbit, the invariant ellipse is the periodic solution. 

Now let us extend the calculation to multiple errors. The one turn matrix 
is 


^q ow, 1 0 ^ 1 0\ » 1 0\ n 
M = a c. 1) (_ aia, 1) te (_ (ann, 1) o 
where, from eq. (6.9), 

es of fe WO dm 

MS Cvm Wy) iid we a) | 


The factorization gives us a way to simplify the calculation by working in 
the normalized space where the matrices between the kicks are rotations. 
Specifically, we have 


M= vbo ] ae ; VVB 0 
—ao/ V/Bo 1/v/Bo ao/ v/Bo v/Bo ] 
and, denoting AK,, = (Akl) mbm, 


M = B(bno) (ke, » +++ R(d12) ee 1) Ré). 
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To the first order of the errors, the one turn matrix becomes 


=a 5 A | sin mo COS Pom sin dmo sin dom 


COS dmo COS Pom COS dmo Sin dom 


n ds sinu —cosuV 1X SIN [lm COS Üm 


cosu sinp COS [lm — SİN fim 


where j denotes 


The change in tune can be obtained through the relation 


1, (3 u 
cos (u + Ap) = 3 tr U ) = cos — l 5 AK, sin 4, (11.2) 


m=1 


which is, to the first order of Ak, 


If the change in focusing is due to the difference in momentum, this equa- 
tion gives the chromaticity. Since the tune is changed due to the presence 
of gradient errors, one important question is how far away it has to be from 
an integer or a half-integer in order to maintain stability for a given set 
of errors. This interval in the tune space that the motion is unstable is 
called the stop band. Assuming that the errors are small, the stop band 
Ap is small, too. As a result, the unperturbed tune can be written as 
u = 2n(p+e) or p = 21 (p--1/2-- e), where e is small. Therefore, we 
have sinu =; +2re. Plugging it into eq. (11.2), we find that the term 
— (0/2) - 555,4 A Ks] sin u =2 Fre >" _, AKm, which is a second order one. 
Hence we have to go one step further to include the contribution up to the 
second order of Ak. The one turn matrix in the normalized space is 


E ee sinu — cos pL ee sing%  cosj 
M =u- 3 YAK (oe sinu -3 2 AKm cos fi — sin ji 


m=1 m=1 


+ Y) AMAK mR m0) (10) Rm) (1 0) R60 


l,m=1,l<m 
-R( ISAK sinu — cosu SISAK sing cosi 
TORUM : ™\ cos sinp 2 ™ \ cos — sini 
m= i 


ne Y Aki AK», 


l,m=1,l<m 


| sin dmo sin d1mCO8 io sin dmo sin dim sin G10 


COS mo Sin $j,,COS Pro COS Omo Sin dim sin Qio 
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The trace of the matrix is 


~q n TL 
tr (a1 ) = 2 cos u — » AK, sin u + y AK AK, sin dim sin(u — Qim). 


m=1 l,m=1,l<m 


To the second order, the trace is 


xa n 
tr U ) — +2 i Dre? = re 5 AK, 


m=1 


n 


+ OM AK AKm sin fim sin (21v — dim) - 


l,m=1,l<m 
The last term can be simplified, which is 


S AKIAK,, sin dim sin (21v — Gm) 


l,m=1,l<m 


= I 5 AK AK, [cos (2015) = 1] 


l,m=1 


II 


1 n 
4 5 AK AK, cos (27) cos (2Y m) 


l,m=1 


Ha 2 AK AKm sin (201) sin 2.) — 7 2 AK AK 
2 


Y AK, sin (2Vm) 


m=1 


5 AK, cos (2Y m) 


m=1 


1 
4 


2 T 2 
ES bs val | 
m=1 
where V,, is the phase at the mth quadrupole. For the integer stop band, we 
have 


tr (s) =2 


2 


2 AK, cos (2Y m) 


1 — 277? — re 5 AK, 


m=1 


IHE] 


The unstable region of the tune is determined by the relation 


"E abs AK, sin (2Vin 


m=1 


n 2 
Ine? + me 5 AKm- -= 
m=1 


5 AK, cos (2Y m) 
m=1 


2 
1 n 
+5 |X AK yy sin (2U;n) 


m=1 


<0. 


" 2 
= |X AKm 
m=i 
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Denoting 


2 


2 


Y AK, sin (2¥m) 


m=1 


5 AK, cos (2Y m) 


m=1 


+ 


, 


we see the unstable interval of the tune is 
Au 


< ; 
2 


1 TL 
2 = AK, 
T€ + 2 2 


and Au is called the integer stop band. Similar calculation shows that the 
same expression also gives the half-integer stop band [16]. 

In order to obtain the change in the invariant ellipse, we have to go back 
to the original space, where 


Nn = vV/Bo 0 rae 1//Bo 0 
=20/Vbo l/v/Bo oo/VBo VBo 
B e a Bosin u ) 


—yo sin u COS u — Qo Sin u 


lie sin | — Qo cos H — bo cos u 
-3X ak. ( 


A Yo COS u sin u + Qo COS u 
m= 
E sin Ji + ao cos [t Bo cos ji 
m 2 bs AK Ta 2 d 2 2 inn E e]: 
m=1 ( aĝ) cos [1/ Bo + 20:9/ Bo sini — Qo COS LL 


Immediately we have 


cos (u + Ap) + (ao + Aq) sin (u + Ap) 


i< 5 3 
= cos u + a sin y — 5 2 AK, [sin u — ao cos ji + sin fi + oo cos [1] , 


m=1 


(Bo + AB) sin (u + Ap) 


; 1X m 
= Bosinu — 5 ber [— o cos u + Bo cos ji] . 


m=1 
To the first order of Ak, we have 


1 


Aa = —-—— 
x: 2 sin u 


n 
5 AK, [sin im + ao cos ft] , 


m=1 


n 
y AK m COS fm, 
m=1 
and we remind ourselves of fim = H — 2¢om and AKm = (Akl) m Bm. It is 
clear that when the tune is close to the half-integer resonance, the size of the 
beam becomes larger and eventually goes to infinity as the tune approaches 
the half-integer. 


A01 
fo  2sinpg 
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11.3 Linear Coupling Resonance 


Linear coupling refers to mixing of linear motion between the horizontal 
and vertical planes. Coupling between transversal and longitudinal motion is 
present as well, but it is usually weaker. Linear coupling between transversal 
planes arises from solenoids, roll of quadrupoles, vertical misalignment and 
orbit offset at sextupole locations. In this section we study the effect of skew 
quadrupoles, which can be present intentionally or due to the roll of normal 
quadrupoles. Again, let us first assume that only one skew quadrupole is 
present, which is denoted as k,. Without loss of generality, we assume that 
this skew quadrupole is located at the end of a turn and that it is thin. As a 
result, the linear one turn map is 


X1 1 0 0 0 To 
a1 = 0 1 ksl 0 A ao 
Ui B 0 0 1 0 M Yo : 
bi kl 0 0 1 bo 
where 
COS[La d- oz SIn Ug By sin Ly 0 0 
^ —Yzx SİN Hy ^ COSA; — Or Sn pus 0 0 
M= S à 
0 0 cosu,--oysinpg, By sin Hy 
0 0 —Yy SiN fy — cosuy — AySinfly 


Applying the same coordinate transformation, we obtain the normalized co- 
ordinate system, which is 


a Ox / V Be V Bx 0 0 a 

y 0 0 1/yB, 0 y 

b 0 0 ay/V/By V By b 

As a result, we have 

Ti 1000 COS Ux Sin [Ly 0 0 To 
ay _{[ 01k 0 — SİN [ly COS [ly 0 0 ao 
h| | 0010 0 0 cosy sinply | | Vo |’ 
bi K001 0 0 —sinpy cospy/ \ bo 


where K above denotes 


K eds ay. (11.3) 
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To shorten the equations, let us use again the following symbol 


COS [lz SİN Hy 0 0 
A Ge — SiN Uy COS Ly 0 0 
R = , 
Hy 0 0 COS Hy- SiN [ly 
0 0 —Sin fly COS [ly 


After n turns, the coordinates are 


TL 


du 1000 do 
Gn | 01K 0 iE) Go 
Yn 0010 My Yo 
Dn K001 bo 


To the first order, the coordinates are 


Tn 0000 
ün " 00KO0]|. 
5 |^ s)" |) 
Un Da 0000 Ta, 
b, K000 
0000 
; rst A Oils. E 
+A(M) ae Debes 
Hy 0000 (n — 1)uy 
K000 
0000 | To 
A = 00K 0 a 
(ec) a 
(n—1)py/} 00 0 0 Hy Yo 
K000 bo 
| 0 0 0 0 
_ | 6 (Nb 0 0  cos(npuy) sin(npuy) 
= a(t) 4K 0 0 0 0 
COS(N Uz) Sin(Nftx) 0 0 
00 2 b 
j Run a 
n 00 d 
+K P IE 
2. A 00 Yo 
m | Rox 
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where 
Ru, = (310 mus] sin (mpy) sin((n — m) pr] sin (mn) 
cos [(n — m) pa] cos (mj) — sin [(n — m) us] cos (py) J” 
hyp c {C09 = m) pulsin (mis) sin[(n — m) py] sin (mns) 
cos[(n — m) uy] cos (mps) — sin [(n — m) py] cos (mpe) J 


The nonzero terms in the transfer matrix can be further simplified using 
trigonometric relations. Therefore we have 


: cos [(n — m) tz] sin (muy) -E esl [((n — m) fea] Sin (muy) 

= ES ipu, = m (pie = ny) — sin [rte — m (us ny) 
m=0 

EU sin [n (ps — ny) /2] 

= 5 mueren | rr 


-sin m =i (n-1) (He uS má 


sin [(us + Hy) /?] 


EE 


sn (He + n) sin E (us — n) 


sin [n (us + Hy) /2] 
sin [(us + ny) /2] 


En (un = n) | sin [5n (ue i § 
= I {sin [z^ (Ha + n) cos E (us — n) TEM 


— sin E (us — n) cos E (Ma + 7) YER — sin (rn) 


| 
— sin E (He — i) cos E (Ha + i) 
| 


When py > +f, +27N (N is an integer), we obtain 


1 
3 cos [(n — m)us] sin (muy) > 5 (n — 1) sin (nyy). 
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Similarly, we have 


> cos [(n — m)us] cos (mq) RT cos [(n — m)us] cos (mj) — cos (niis) 


= feos E (us + )| cos E (us — ») ae e 


+ cos E (Hz — n) cos E (Ha + )| Ce) 


— cos 3n (e + | cos |n Qs = n) ; 


so we obtain when py > cp; + 27N, 


een, 


1 
p cos [(n — m)us] cos (mu) > 5 fo — 1) cos (nuy) + EF 
y 


Furthermore, we have 


£ sin [(n — m)uz] sin (muy) = 5 sin |(n — m)uz] sin (muy) 


sin [n (us — Hy) /2] 
sin (us — My) /3] 

EIN 
sin (us My) /2] f 


i 1 {cos [ente] eos [3e — 7] 


+ cos [z^ (us — 7 cos E (us + n) 


and we obtain when jt, > cus + 22 N, 


see DAY 


sin fly 


1 
3 sin [(n — m) uz] sin (muy) > ES te — 1) cos (ni) — 
Lastly, we have 


2 sin |(n — m) uz] cos (muy) 22 sin [(n — m)us] cos (mj) — sin (np) 


= {sin [z^ (He + n) cos E (Ha — » 


sin [n (us — uy) /2] 
sin [(us — Hy) /2] 
EIE NNI 


+sin E (us — p) cos ls (uz + )| sin [(us + uy) /2] 


and we obtain when pry > cus + 22 N, 


1 


x sin [(n — m)u;] cos (mu) > + T5 (n — 1) sin (npuy). 
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The upper right block of the matrix can be obtained by switching py and 
Ly Of the results above. In summary the transfer matrix at the limit of 
Ly > Hg + 27N is given as 


E E ds 1 A 
in = fan + K [in + ses sn as 


2 
where 
Tn To 
Gn ao 
Zn = ~ , Z9 = ~ , 
Un Vo 
b, bo 
cos (Nuz) sin (nuz) 0 0 
— sin (Nuz) COS (Nux) 0 0 
0 0 cos (nuy) sin (nyy) | 
0 0 — sin (Nuy) cos (npuy) 
0 0 0 
0 cos(nu,) sin (nuy) 
0 0 ] 
cos (nz) sin (nj) 0 0 
E Sg 
ò Sr 0 
+s 0 0f 
0 0 0 
E 0 sin (Nux) F cos (nj) 
^l 0 0 COS (Nus) Esin (nj) 
MA sin (Nuy) F cos (ni) 0 0 , (11.4) 
cos (ni). x sin (nuy) 0 0 


and the coupling terms s; and s, above are 


sosm(n-D44 — Q0 sin (n= 1) py] 
D sin Ji i ? sin [ly 

indicating that an arbitrarily small perturbation leads to arbitrarily large 

coupling between the horizontal and the vertical spaces. 

Similar to the half-integer resonance, the presence of the skew quadrupole 
component leads to a stop band gap in which the beam becomes unstable. 
'To illustrate this point, let us first work the formalism of the eigenvalues of a 
4 x 4 symplectic matrix. 

Before we treat linear coupling, let us look at a few general properties of 
symplectic matrices and their eigenvalues. Recall that the symplectic condi- 
tion is 

MT jM = Jj, 
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where M is a 4 x 4 real matrix. The eigenvalue of M can be obtained through 
solving 


det (7 > AD E (11.5) 


Since M is real, 


Mi = A? Mis = Atot Mit = Mt. 


So A! is also an eigenvalue of M and @ is the eigenvector. Keep in mind that 
det(AB) = det(A)det(B), det(ÀT) = det(A). 
With these identities, we can transform eq. (11.5). 


det (I — Af) = 0 = det F- det(A — M) = 0 = det(J M — AJ) = 0 
= det MT . det(JM — AJ) = 0 => det(J — AMT J) = 0 

= det(Î — AMT) = 0 => å?” det (A71 Î — MT) = 0 

—5 det(MT — A711) = 0. 


Therefore A-! is also an eigenvalue. Together with At, we reach the con- 
clusion that if M has an eigenvalue A then At, A71, \1~! are also eigenvalues. 
We know that for M to be stable, |A| has to bs smaller or equal to 1 for 
all eigenvalues of M. As a result, all eigenvalues of M lie on the unit circle. 
Apparently, we have, for this case, A = A!-! and A7! = Af. 

The next question we can ask is that supposing M is stable and every 
eigenvalue is reasonably far away from each other, what happens when M 
is perturbed? In terms of betatron motion, it means that uy and py are 
reasonably far away. 

To frame it in a more mathematical way, we say that there is a neighborhood 
around each eigenvalue so that there is only one eigenvalue in it. When M 
is perturbed its eigenvalues will move. They may all stay on the unit circle, 
or some of them may move away from it. Let us say one of them, A, moves 
away from the unit circle, then \'~! will also move away from it and be in 
the same neighborhood that A is in, making the total number of eigenvalues 
greater than 4, which is impossible. So the conclusion is that every one of 
them will stay on the unit circle (see Fig. 11.1). This is consistent with our 
experience, which is that when we change quadrupole strength, zz and ju 
change, but the motion is stable. 

The question that relates to linear coupling is what happens if two or more 
of the eigenvalues get close to each other? The answer is more complicated. 
It has been proven that when two colliding eigenvalues have the same sign of 
phase, motion remains stable after collision. Otherwise, instability may occur. 
In other words, difference resonance jig — Hy does not lead to instability, sum 
resonance [lz + py does. 


Resonances in Repetitive Systems 277 


FIGURE 11.1: Distinct eigenvalues around the unit circle. Small circles 
show the neighborhood of each eigenvalue. 


_ Let us focus on coupling between horizontal and vertical planes. The matrix 
M is a 4 x 4 symplectic matrix, which we describe as 


; AB 
M = A ^ 
(6 5) 
where A, Ê, C and D are 2 x 2 matrices. From eq. (5.11), we have 
^ AC 
e. A U 
e 


where A is defined as A = CAP AT as well as B, C and D. Let us solve for 
the eigenvalue A of M + M~!. Obviously A = A + 1/A, because 


Note that 


ALAS A +( Q22 —012 


Q21 422 


Using the relation 
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we proceed and obtain 


ae (rein af) = ao: (57 z EU X 

C+B (tr D — A)- T 
— det [tr A —A)(tr D - AM — (tr À -AM(Ó + By((tr À - AM)" (B + c 
— det [(tr A - Jtr Ô- ay - (6+ BB 4- €) 


To find A, we have to solve the equation 


det [tr A -Atr D — A)- f — (64+ B)(B4 €) - p. 
Now we have to find out what (C + B)(B 4- C) is 
É ^ bii b12 C22 —C12 bii d 622. big — c12 
B+C= + = 
D s a n aa) Us —€n b22 +11 ) ( 
^ D C11 C12 b22 bs Cii d 023. C12 — b12 
C+B= + = = 
T f E È ba bu ) — b21 C22 + b11 ) @ 


We have just shown that 


C+B=B+C. 
Then we obtain that 
(Ó + B)(B + C) = det(Ó + B) - f. 
So the equation for solving A becomes 


(tr À — A)(tr D — A) — det(Ó + B) 


I 
2 


Then 
A? — (tr Â + tr D)A + tr Â- tr Ô —det(C + B) = 0 


i A Â — tr Dy? ES 
(tr A tr D) T acu + det(C + B) : (11.6) 


= Å= 


NI = 


This is the standard result of coupled motion, where the eigenvalue of the 
motion in one plane depends on the motion in the other plane and vice versa. 
The main features of eq. (11.6) are that Ay > tr Á when tr A > tr D and 
A, > tr D when tr À < tr D; and that A- > tr D when tr A > tr D and 
A_ — tr À when tr À « tr D. 

Courant and Snyder [16] have shown that the sign of det(C 4- B) determines 
the stability of the motion when tr À c tx D. Specifically, when pig — ply = 27N 
(difference resonance), det(Ó + B) is positive, A is real, A is on the unit circle, 
so motion is stable. When ps + jj, = 27N (sum resonance), det(C + B) is 
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Ha 


Ma Hy 


Hy 


FIGURE 11.2: Crossing the difference resonance. Before: juz is constant 
and uy increases. After: fz and py reverse roles. Throughout the process, 
the ratio of the knobs (quadrupoles in synchrotrons) remains unchanged. 


negative, A can be complex, A moves away from the unit circle, so motion is 
unstable. 

Let us now look at the difference (ps — Hy = 27N) and sum resonance 
(Liz + Hy = 21 N) separately. In case of difference resonance, we have 


tr A — tr D)? ie. E 
Ny = 59 (AED + det(C + B), 
A, = ei» + e I^» = 2c08 ps 
tr À — tr D)? re 
=> COS Uy — COS Hy = acu + det(C + B) 
tr A — tr Dy? P 
— (cos, — cos)? = Giu + det(C + B). 


As a result, there is a minimum separation between the tunes when the 
difference becomes small, as shown in Fig. 11.2. The minimum separation of 
tunes can be used to determine the amount of coupling in an accelerator. 


In case of sum resonance, motion becomes unstable when 


tr Â — tr Dy das 
CAED) c ae B) 
It is illuminating to compare the result above with that of eq. (11.4). There 
both sum and difference resonances show unbounded growth in coupling, yet 
only the sum resonance leads to instability in the 4D phase space. 


To illustrate this, let us follow Courant and Snyder [16] and work out the 
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one turn map when a skew quadrupole is present. 


10 0 0 COS Ug SIN [Ly 0 0 
M- 01K 0 — SİN Hy COS Hy 0 0 
0010 0 0 COS fy SiN Hy 
K001 0 0 —sinpy COS [My 
COS [lz — Sin Hg 0 0 


—sinu,;  cosu, Kcospy K sin py 
0 0 COS uy iunge 
Kcosu, Ks, —sinfly COS py 


B=K/( 0 v J: e-«( j my j 
COS [ly SiN [Ly COS [lz SİN [Lz 
and 


ĉ+B=x|( 0 0 )+( sin [ly o) -«( SIN [ly 0 ) 
COS [ly SİN Ly — COS fy 0 COS [lz — COS fy SÌN [Lz 


Furthermore, we have 


Hence 


det(C + B) = K? sin pz sin py. 


For sum resonance, we have sin jj = — sin py and det(C -- B) < 0. The motion 
is unstable. When there are many skew quadrupole components in a ring, to 
the leading order and near the sum resonance, we have 


det(Ó + B) = — sin? u Se K2, 
m=1 


where we use the abbreviation Km = (kl) / sys as eq. (11.3). The 
stop band is 


n 
(cos Hæ — COS fly)” < sin? u 5 Ke, 
m=1 


and the full width, to the leading order, is 


For difference resonance, we have sin yy = sin uy and det(C + B) > 0. The 
motion is stable and the minimum tune difference is 
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11.4 Third—Integer Resonance 


In this section we deal with the third-integer resonance, which is generated 
by the sextupoles introduced into the ring to compensate for chromaticity. 
Similar to the previous section, we will start with a particle with an arbitrary 
position and angle and demonstrate the resonant behavior when the tune 
is close to the resonance. Again, let us first consider the case that a thin 
sextupole is located at the end of the ring. The one turn map is 


X1 on HH o XO 
im) (e) (2) 
= T o | ( COS + asin wu Bsin p £o 
~ a H ksr? —ysinu cosu— asinu ) Vao J|. 


In the normalized space 


aaa 


ž : Zo 
Guo) ro] 


The position and angle after n turns are 


Pa i mo (u) (=) oak P 


To the first order of ks, we obtain 


(=) = (wy (2) 


m=0 


R 


wE) 
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_ | Zo cos (np) + Go sin (ny) 3 0 
| —Zo sin (nw) + Go cos (nj) TE | [Zp cos (np) + ŭo sin (np)? 


"T 2 eo a ee el 


{Zo cos (n — m) u] + Go sin [(n — m) u]. cos (mp) 


|. | cos (ng) + ao sin (ny) à 0 
2 B —Zo sin (nj) + do cos (nj) TOP | [Zo cos (nj) + Go sin (n)]^ 


ug e E (nm 
S MEE ITA = \ cos (mp) 
n—1 [ 
l 


2n — m) u] — sin [(2n — 3m) u] 


=, X cos [(2n — m) u] + cos [(2n — 3m) u] 
cos [(2n — 3m) u] — cos [(2n — m) u] 
E n Bf ado 3 sin [(2n — m) u] + sin [(2n — 3m) u] ) l 


Again, using the formula 


3 cos (z 4- my) — seen E, oe wis eeu) (11.7) 
we obtain 
3 cos (mp) = X cos [u + (m — 1) u] = 3 cos (u + my) 
m=1 m=] m=0 


_ cos [u + (n = 2) n/2]| sin|(n — 1) u/2] _ cos (nui/2) sin [(n — 1) u/2] 
sin (u/2) sin (u/2) 


Y ex[Q-m)u- Y ex[G2n- 1) (n1 


B _ cos (3nu/2)sin [(n — 1) u/2) 
= x cos |(2n — 1) u — my] = t eR - vi 


n—1 
Y; cos [(2n — 3m) ui] =F el 2n — 3) n — 3 (m — 1) u] 
m=1 


n—2 


= = M cos[( [(2n — 3) u — 3my] = 


m=0 


cos (nu/2) sin [3 (n — 1) u/2] 
sin (34/2) f 
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From eq. (11.7), we have 
n-1 = 
Y sin (z + my) = ¥ cos (r-+my— 3) 
d cos [x + (n — : Ja —n/2|sin(ny/2)  sin[r + (n — 1) y/2] sin (ny/2) 
in (y/2) sin (y/2) l 
As a result, we obtain 


n—1 n—2 


X sin ( (iu) ET [w+ (m— 1)u] = Y sin (u + mp) 
m=1 m=0 
_ sin (nu/2): sin [(n — 1) u/2] 
sin (u/2) 


V. sia (Onn) i] = S sm[On cede Gad 


- Ẹsin l@n = nic mp = ERUS Du 


sin (u/2) 
Y sin [(2n — 3m) u] = ¥ snl 2n — 3) u — 3(m — 1) u] 
dr x sin (njt/2) sin [3 (n — 1) u/2] 
= pop —3)u—3my] = —— duhmubrp---- 


The position and angle after n turns are 


Tn |  [ Locos (np) + ao sin (nu) à 0 
b n E. sin (nu) + do cos m Pu [s cos (nu) + Go sin "T 


1 sin [(n — 1) u/2] ( sin (nu/2) 
j^ (25 a0) sin (u/2) ie 2) 
"m: 2 sin (n = 1) 1/2) sin (3np/2) 
LE (15 — ab) sin (u/2) [s M 
ə sin [3 (n — 1) n/2] f — sin(nu/2) 
> pe (26 — 26) sin (34/2) | cos (nu/2) 
7 i, nre sin [(n — 1) n/2] [ — cos (3nu/2) 
go Oe ETE (u/2) sin (3np/2) 
E m nre sin [3 (n — 1) u/2] [ cos (nu/2) 
7 D i (3u/2) sin (nu/2) J ' 
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which shows that the position and angle become arbitrarily large when p > 
2nk or 2n(k + 1/3). 

Next, we will study the deformation of the invariant. Since the sextupole 
affect only the nonlinear part of the motion, the perturbation of the invariant 
is of the third order and higher. One way to obtain the new invariant is to find 
a new coordinate system in which the motion is a circle. The new invariant 
can be found via the relation between the new and the old coordinates. This 
method is called the normal form theory. In general, there is no analytical 
solution to the perturbed invariant. We can only obtain it perturbatively. In 
order to demonstrate the essence of the method, the lowest order perturbation 
of the invariant will be derived. Let % and à be the new coordinates such that, 
in this coordinate system, the motion is a circle up to the second order. gine 
the linear motion in the coordinate system of (z,à) is already a circle, the 
general form of the relation between the new and the old coordinate systems 
can be written as 


z 2j d Au? + Artă + Agoda? 

Po = A (0) i = m + z5 n <2 ; 

ü a a Bic T Bıoza E Boga 
where A denote a second order transfer map that transforms (x,a). The 
inverse, to the second order, is 


~ ms = x2 mE x2 
b — Ao B E B = id id Bat 
a a a Bui T Biota d Booa 


= | = Ao Peel ee eee Bod Ces) 
ay a + k,B2z? —2x sin u + a cos p do 


Expanding it to the second order, we have 


D E us cos u + 9 sin y af Ae) EGA RN Eca 
ài —To sin u + áo cos u Zla 22a 23a 
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a NA SS. xg 
To COS” u + Todo sin (25) + Gp sin n) 


122 zz 122 
+ Aig (-58 sin (24) + Todo cos (21) + 3 do sin (2u)) 


? 
ho 


e ~2 
+ Age (žo sin? u — Fao sin (2u) + Gp cos? n) , 


T t ~2 
Zia = Bir (Žo cos? u + Zod sin (24) + [^ sin? n) 


1x2. ses 1z2. 
+ Bio (-5& sin (24) + Zoo cos (21) + 540 sin u)) 


x. 2 za za 2 
+ Boo (žo sin” u — Todo sin (2u) + Gp cos n) ; 


~2 ~x ~2 x2 xx x2 
Z2x -- (Au, + Aj2%9a9 + A22ã0) COS [1 — (Buz + By2%oao + Bačo) sin 4, 


x2 xx x2 x2 cu x2 
26 — (Anto "E Aj2%oao 2 A22ã0) Sın y — (Buz + By2Xoao0 + Basti) COS LL, 


232 > 0, 
3 (x2 9 Pe d GA. 85 
23a = k8? (3; cos” u + Todo sin (21) + Gp sin i) . 


Since the one turn map in the new coordinates is a rotation up to the second 
order, we obtain the following equations 


1 
A11 cos? u — 542 sin (2u) + Ago sin? u — A11 cos u — Bıı sin u = 0, 
A11 sin (21) + A12 cos (24) — As» sin (24) — A12 cos u — Big sin ji = 0, 
1 
A1 sin? u + 742 sin (2u) + A22 cos? u — Age cos pp — Bog sin u = 0, 
1 
Bıı cos? — z722 sin(2u) + Boo sin? + Ay, sin u — By, cos u = —k,B? cos?p, 


Bıı sin(24) + B3 cos(21) — B22 sin(2u)+ A12 sin 1i — B12 cos p = —k, B?sin(2j), 


1 
Bıı sin? u + 5B sin(21) + B22 cos? u + A22 sin u — B22 cos u = —k,B? sin? u. 


After tedious but straightforward algebraic and trigonometric manipulations, 
the solution is obtained, which is 


a cos (41/2) cos u a cos? (1/2) 
Ai = —k, Rusia fea saad et A 20, Ago = —k,83 — 
^h P^ Jain (3u/2) i id P^ sm (31/2) 

l, os a cos (u/2) cos u 
By =—-=k,8?, By = kb? —— ——— ——, 
11 2 B 12 B sin (32) 
As a result, the transformation defines a coordinate system in which the mo- 
tion is a rotation up to the second order. In other words, we have 


xe c 
r +a =€, 


Bog = 0. 
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which take the form 
(Z-- Au? + Agga?)^ + (G+ B? + Bura) =e 


in the old coordinates. Keeping only the terms up to the first order of k,, we 
have 
z? + a? + 244,8? + 2B + 2 (A22 + B2) Ea" = €. 


Since 


| cos? (u/2) , cos (u/2) cos p 
sin (34/2) sin (34/2) 
os (4/2) [cos u — cos? (u/2)] 


Ago + B1» = kG? 


= het sin (3/2) 
u 3cos(u/2)sin? (u/2) _ a sin (4/2) sin u 
EET (31/2) PADRE 2sin(3u/2) ' 


the perturbed invariant is 


a. ums a 2)cosu.4 ~o~  sin(u/2)sinp 
2 2 k, g [cos (u/ 3 2 2) _ 
T +a B? sin (Ga) T LEX EET aBa a €, 
or 
TT ee ae T £ " 
T + d^ — —— T [T cos (u/2) + àsin (u/2)] [3 cos u + à sin ui] = e. 
sin (34/2) 


Apparently the invariant diverge when u — 27/3. The invariant in the above 
equation is shown in Fig. 11.3, which shows the presence of the third-integer 
resonance. As a comparison, the true invariant obtained through tracking 
is shown in Fig. 11.4. It is clear that the invariant obtained from the first 
order perturbation theory agrees with the exact invariant qualitatively but 
not quantitatively. 

Like the half-integer resonance, we can go one step further to include the 
terms that are proportional to k?. Again, we can attempt to find another 
coordinate system in which the motion is a rotation up to k?. But first of all, 
we have to obtain the third order one turn map in the coordinates of (2, à). 
From eq. (11.8), we have, to the third order, 


gi —— oe , A aA To 
üi a + k, 02a? —x sin u + a cos p To 


r + Aya? + A220? x cos u + asin p 
= [9] 

a+ Biz? + Biaza —q sin u + a cos p+ ks8? (a cos u + a sin u)’ 
e z2 za 9 Z3 c2 Ea 
Xo — Aito — A2200 4 2A71 Žo + 2A22By1%9a0 + Ag2By2%0G 
x za Soi c2 25.74 z3 
ao — B11tg — B1200 2A? Too + 2A22Bi1 Fa) + A22 B12üg 
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0.4 F 4 0.4 
0.2 r 4 0.2 
iS gb 4 0 
0.2 } 4 -0.2 
0.4 F J 04 
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FIGURE 11.3: Invariant obtained through first order perturbation theory 
(left) and tracking (right), with k,82 = 1 and v = 130/360. 


lr 1 
0.5 F J 0.5 F 
Is 
° i 
-0.5 F 1 -0.5 } 
1 -0.5 0 0.5 1 -] 
x 


FIGURE 11.4: Invariant obtained through first order perturbation theory 
(left) and tracking (right), with k,2 = 1 and v = 110/360. 


The last line is the function A~! to the third order. It is straightforward to 
verify that it is true using the relation B1 = —2441. Yet Aand AT! are not 
symplectic to the third order, which is easily verified using eq. (5.6). The 
details of obtaining the symplectic version of Aand AT! are beyond the scope 
of this book. The result is actually very simple, which is 


x3 cm =~ 22 

x A12? + A20? + A? To + A22 B11T9ào a A11 A2200 
prs Ac. z3 

a+ Bia? ines 2A411xa “TF A? Todo T A2» B11T9àg ze A11 A224 
2 2 2 Z3 zz zo» 

Xx — Auc = A24 + Aj, Xo + A22 Bi £9 ao m A11 A2200 
pu eo pres z3 

a— By x? + 241124 Te A? Zao UE Ag2 Bi T9 — A11 A229 


d ue 


Again, it is straightforward to verify the symplecticity of the above map using 
eq. (5.6). Note that the third order part in Aand A`! is half of that in the 


288 An Introduction to Beam Physics 


non-symplectic version of A^. 
After a straightforward yet rather lengthy derivation, the final result is 


= = pM z3 pe dos = x z3 
T1 To COS p, F ao SIN H "n Aj11% + A112000 + A122X909 + A222, 
emule S id x ~8 M2 ~~2 3’ 
a1 — To SIN H F Ag COS U Bii + By12%9a0 e By22%d IS B22200 
where 
cos? (u/2) sin u cos? 
A11 = E EAA 
sin (3/2) 
3 (u/2) cos u 
ie E e 5 ect 
112 3B 2 sin (3u/2) [3 cos ( u) ] , 
* (u/2) sin (u/2) 
A122 = k2 gos uem QR) 1 + 3cos (2 
122 = Kj sin 312) [1 + 3cos (24)] , 
4 cos? (u/2) sin? (u/2) cos u 
Agog = k2g3 m ore 11.9 
222 25 sin (3u/2) , ( ) 
and 
Bui = k28? 4cos? (u/2)sin? (u/2)cosu | cos(u/2) cost p 
d. sin (34/2) sin(3u/2) f’ 
* (u/2) sin (1/2) 
Bag = k2 B3 £08 (u/ 1 ; (9 
112 = kgb { sin (3/2) [1 + 3cos (2y)| 
622 (u/2) cos? (u/2) cos? u 
sin (34/2) f 
3 (u/2) cos u 
Biss = k2 83 cos" (1/2) cos y 2u)—1 
1an = R38 [ST EISE toos) — 1) 
n2 3 2 
1222 Wife) COP (u/2) cos? u 
sin (34/2) 
Bon = k28? cos? (1/2) sin pcos? u cos (4/2) cos usin? p (11.10) 
AE sin (31/2) sin (34/2) i i 


The next step is to find a second order transformation (in terms of ks) that 
the third order map in the newest coordinates is a rotation. Unlike the first 
order transformation, not all nonlinear terms in the map can be removed even 
if u does not satisfy any resonance condition (i.e., v is irrational.). It is much 
easier to illustrate this in the eigenspace of the linear map, which is complex. 
Let us denote (s*,s~) as the complex coordinates which are related to the 
real coordinates by the relations 


sT 1/1 i\(z 
PETENS 
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AEN e 


It is obvious, in this case, that the map in the complex coordinates is sym- 
plectic as long as that in the real coordinates is, since the determinant of the 
Jacobian in the complex coordinates equals that in the real coordinates, which 
equals to 1. The linear one turn map in the complex coordinates is 


D EE 2) 0m 


For the time being, let us consider the generic case that the map M contains a 
linear part R and a nonlinear part S. In the complex coordinates, R is shown 
above. Furthermore, let us assume that the lowest order terms in S are of 
that of m. Note that if m > 2, M is the map that has been transformed 
through the nonlinear normal form transformations at least once. Now let us 
consider M only up to the mth order, i.e., 


The inverse is 


Mm = R+Sm. 
Define the coordinate transformation as 
Am — WA + Tari 


where Z is the unity map and Tm contains terms of the mth order only. Hence, 
to the mth order, 
An =L- Tm: 


The normalized map, to the mth order, is 


Nm =m Am © Mm o Am =m (LET) ORAS) OL = Tan) 
=m (L+Tm)°(R+Sm — R o Tm) =m R+ Sm — (Tn oR - R o Tm). 


The goal is to cancel as many terms in Sm as possible. Let us evaluate the 
map Tm oR — Ro Tm, which is 


TImoR-RoTm 
bs o i ( "sp j- ( in ; (i qu. M] 
e INDE Co) amy A Jes rupe up) 
M Tran ls)" (e$) [ete ec m] 
> a (ss) (st) [em — ei] 


Apparently terms in Sm cannot be removed if the corresponding terms in the 
map Tm o R — R o Tm are zero, and they are zero if 


gii (m—2k) — ett — 0. 
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That is 


ett (M—2kF1) = 


, 


which is fulfilled when 
ii (m — 2k xz 1) = 2a, 


where n is an integer. Apparently, the solutions of this equation can be divided 
into two classes: one that is independent of u, which is k = (m F 1)/2, and 
the other that depends on p. The solutions that depend on u are called the 
resonance conditions, which can be avoided with the choice of p. The ones that 
are independent of u provide the terms that cannot be removed from the map 
regardless of the choice of the tune. First, let us consider the case of m — 2. 
Since both m and 2k are even, under no circumstance m — 2k = 1 = 0. This 
is consistent with the fact that a solution was found above that transforms 
the second order map into a linear map. For the u dependent solutions, it 
is straightforward to verify that all six solutions are included in one simple 
relation, which is 3u = 2zn. Note that this is none other than the condition 
of the third-integer resonance. Second, we consider the case of m = 3. There 
are two solutions that are independent of u, which are k = 2 for the top row 
and k — 1 for the bottom row. Again, all eight solutions that are dependent 
on p are included in the expression 4u = 27, which is the condition of the 
fourth-integer resonance. In case of the sextupole, it drives the third—integer 
resonance to the first order of k, and the fourth-integer resonance to the 
second order of ks. Since we usually set the tune away from the third and 
the fourth-integer resonances, we can obtain a third order map that takes the 


form of 
nem | gar + S3 (sg) | . 
€^ sj + 351 (so) (so) 
Let us focus on the new map, since we will not try to obtain the second order 
distortion of the invariant. Going one step further, we have 


Nes | [e H+ S3350 s] So ) 


[e + S555 sd] sd 


Transforming back to the real coordinates, we obtain 


m -( Zp cos p + da M Fo + d | (S32 + S4) o (85, — 22 
4 \=i(S52 — 831) To + (S32 + S31) do 
(11.14) 
As a result, we conclude that 5554-53, is real and 85; — 93, is purely imaginary. 
In order to make clear the meaning of the third order terms in N3, let us first 
show that M3 is symplectic to the third order. First recall that 


—&o sin u + do cos u 


N3 =3 Aa 0 Az o M o A5! o Aj’. 
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Defining A23 =3 A3 o A2, we have 
N3 =3 A230 M2 o Aj, 


When carrying out the transformations, we usually make sure that A23 is 
symplectic up to the third order. Therefore, we have 


MS A A A A fa PES d P eats A AA qv A 
Ñ Ĵ ÑE = Aso] (Ass M245} ) => dM d (433) MZ A, 
=2 Áos M3J MT Al, =2 Ao3 J AZ, =2 J: 


Now that we have shown that M3 is symplectic up to the third order, we 
can find out the relations between the terms. The Jacobian of M; can be 
written as 


=R+ $3, 


Lu AA IE, —42 
R= Ji : and $5 = S3250 0 Ssa Vs (s 0) : 
ee S31 (sg) 33180 80 


The symplectic condition becomes 


where 


(Ê n 83) J (a +93 7) zo 
which leads to the relation 
&,/ RT + RISE = 


Plugging in the matrices R and $5, we have 
S325050 939 (So J? 0 1 e ^ 0 
Sj; CN S3150 50 cp 0 e 


(c* 0 0 1\ Szsged SH (st)? A 
MEN =i 0/ S; CIN S3150 50 


which can be simplified to 


; 1 
(Sze + Sie) so sg e ;) =0. 
Defining 
Ts = iS3,e™, 


we obtain 
Sz = —iTse ^", Sj =iTse™ 
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Furthermore, we obtain that 
Sz Sj; — -2Tssinu, S32 — Sj, = —2iTs cos p. 
Hence we arrive at the conclusion that Ts is real. Therefore, we have 


M p [1 — iTssg sd | >) 


e" [1 + iT 389 sg | st 


To the first order of Ts, N3 can be expressed as 


ARS = [-i (u + Ts8q s¢)] sg | 


exp[ i(u- Tss so )] so 


It is clear by far that the remaining terms in the normalized map N3 contribute 
to the change of the tune only. It is worth noting that the change of the tune 
is a function of the invariant, which is sometimes called the tune shift with 
amplitude. Computationally, the above described procedure can be easily 
carried out using the Differential Algebraic (DA) technique, which is valid for 
any given order. 

Now the tune shift with amplitude can be determined through the relations 
between the coefficients in the real and the complex coordinates. Repeating 
eq. (11.13) to the third order, using eqs. (11.11) and (11.12), we have 


[E "il jJ | COS u 2 | 1 a e 

st 2\1 =i — sinu cosp —i i st 
D)? + Ane (s0 + 5$) (~is + ist) 

)? + Bus (89 + sd) (—is3 a 

5) (iso + isf)” + A222 (ig + isi n 
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It is Uo to extract the coefficients $3, and S4, which are 


+s 


L [a (Aiii +iB111)— i (A112 +iB112)+ (A122 +iB122)— 30 (A222 +iB222)], 


[3 (A111 —£Bq311) i (4112 — 4B112) + (A122 —i B122) 4-310 (A222 — i B222)]. 
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Furthermore, we have 


a(S T S31) ==( 3Ani + B112 + A22 + 3Bo22), 
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A 1 (9: - 531) 


From eqs. (11.9) and (11.10), we obtain 


(—3B111 + 4112 — B122 + 34222). 
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1 59-53) = 7g x sin (34/2) ` 


As a result, eq. (11.14) becomes 
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where Aj = — (3/8) - k28? - (cos (u/2) cos u/ sin (3u/2)) - (To + ão). Note that 
Ap is proportional to To + Bo which is an invariant of motion. Note that 
the distortion of the invariant of motion is proportional to k. Hence the tune 
shift is small compared to the distortion of the invariant. The result is that 
the third-integer resonance usually leads to arbitrary large distortion of the 
invariant. For higher order resonances, the distortion is either of the same 


order or smaller than the tune shift. The resonances are, therefore, confined 
in the phase space. 
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